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Speed Up MPEG2 Encoding Process By Multiprocessor Computation 


1. Introduction 

Presently, most of the implementations of video encoder or decoder are based on 
VLSI technology. Many video compression and decompression chips have been designed 
and produced [3], [4], [6], [14]. VLSI implementation has the advantage of high 
performance, high efficiency, low power dissipation, and even low cost for large 
quantities. It is suitable for video telephone, video conference, and various multimedia 
applications. However, it also has the disadvantage of high design complexity, non- 
flexibility, and high cost if the application is not in large quantities. It is well known that 
video compression standards like P*64 [18], and MPEG [1], [2], [15] are unbalanced 
compression technology. The processing required at encoder side is more than that at the 
decoder side. The design of the encoder is more complex than that of the decoder. In 
addition, the MPEG standard is also unbalanced in terms of intended applications (CD- 
ROM, CTV, HDTV, etc ). That is, a large number of end users needs only a decoder to 
decode the compressed video. Encoder is needed only by a relatively small number of 
video producers. This further increases the cost of the encoder. For some applications, 
such as video telephony and video conferencing, where both encoder and decoder must 
present for the system to work, performance and efficiency requirement may make it 
necessary to implement the encoder and decoder, or codec, with VLSI technology. For 
applications intended by MPEG, it is not necessary to implement the encoder with VLSI 
technology. In this case, a multiprocessor implementation might be a good choice for 
encoder. 

The multiprocessor implementation has several advantages. The first advantage of 
this approach is its programmability. The implementation of this type is typically software 
programming on general purpose multiprocessor systems or symmetric multiprocessor 
(SMP) servers [5], [7], This is especially good for improving and updating the 
compression algorithm. Since digital video compression is a very young industry and the 
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compression standards (P*64, MPEG1, and MPEG2) only defined the syntax of the 
encoded video bitstream, much room has been left for designers to improve and enhance 
the encoder. Dedicated VLSI implementation may have the high risk in terms of system 
upgrading. Due to this reason, many manufacturers have taken the approach of 
programmable VLSI implementation. In this approach, compression algorithms are 
implemented by microcode downloaded to special purpose processors that are suitable for 
video signal processing. One example of this VSP (Video Signal Processing) approach is 
IIT’s VP (Vision Processor) and VC (Vision Controller) [11], and VCP (Vision 
Controller and Processor) [4], However, it is still not generic enough to implement 
compression algorithms other than DCT (Discrete Cosine Transform) based methods. The 
programmability with multiprocessor implementation dose not have this limit; hence, any 
updated algorithm can be implemented. This means different compression algorithms or 
even different applications can run on the same system. Since video compression and 
decompression are integrated parts of multimedia applications, the capability of sharing 
system resources will be helpful for reducing system cost. TTs MVP (Multimedia Video 
Processor) [5] is a new development in this respect. For MPEG2 standard, 
programmability also facilitates the flexible implementation of different profiles and levels. 

The second advantage of multiprocessor implementation is its scalability in terms 
of system performance. For the efficiency of signal processing, most available video 
compression algorithms process image data on the basis of uniform blocks. For instance, 
MPEG compresses video data macro block by macro block (MB). The processing of each 
MB is fully independent. Since a frame of video usually contains a large number of MBs, 
this provides a very good data partition basis for job assignment. If we denote the total 
number of available processors as N, and the number of MB’s of a frame of video as M, 
then N«M usually holds. The speed up ratio of using N processors over using 
uniprocessor is simply N if we divided/ MB’s uniformly among N processors (ignoring the 
communication overhead at this moment). This is significant because the value of N can 
vary in a relatively wide range and it can be fairly large. For applications like HDTV with 
high image resolutions, the amount of computation at the encoder side significant. A VLSI 
implementation that is designed for a predefined maximum data rate, whether 
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programmable or not, cannot process the data without degrading image quality (e.g. lower 
frame rate) if the data rate exceeds the designed one. It is impossible to increase the 
throughput by simply raising the chip’s working clock rate. A multiprocessor 
implementation, however, can increase the system throughput simply by adding enough 
number of processors. This scalability is also helpful for upgrading the system as the 
encoding algorithm getting more and more intelligent and thus requiring more 
computation power. New functions can also be added by simply adding more processors. 

The third advantage of multiprocessor implementation is that it reduces the 
implementation complexity. This is especially true for highly unbalanced MPEG2 [2] 
encoder. Once the target hardware system is available, the implementation is mainly a 
programming issue which is easier than the design and fabrication of a VLSI chip. If one 
starts with a uniprocessor program, then the parallel programming will be straight forward 
if the job assignment is based on data partition. 

The above argument is based on the assumption that a multiprocessor hardware 
environment is already available. Yes, a multiprocessor hardware might have the image of 
bulky and expensive. However, recent development has shown that multiprocessor 
processing is migrating toward desktop and server systems and even receiving attention 
from the embedded-system community [8], Small-scale systems with 4, 8, or 16 
processors are popular [9] The high computation requirement of multimedia applications 
has pushed the computation toward parallelism. The aforementioned MVP from TI is a 
typical example. It includes a 32-bit RISC master processor and four 32-bit DSPs. It aims 
at real-time multimedia applications: video conferencing, video compression and 
decompression for MPEG1 or 2, video server, 3D graphics, etc. [5], These developments 
in hardware architecture together with the above mentioned advantages make the 
multiprocessor implementation of video encoding algorithms a very appealing approach. 

In the following, we first present our evaluation of the modules of a MPEG2 
encoding process in terms of computation time in section 2. In particular, the two crucial 
modules, Motion Estimation (ME) and TRansform (TR) which includes DCT type 
estimation and forward DCT are analyzed in terms of parallel implementation. Section 3 
describes our multiprocessor implementation of the ME module and the TR module. This 
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is followed by the experimental results and analysis in section 4. Section 5 gives a 
comparison of MPEG2 and Indeo Video technologies [13], [20] based on our 
experiments. We summarize and discuss our results in section 6. 

2 Evaluation and Analysis of the Modules in MPEG2 Encoding Process 

For the purpose of performance evaluation, the computation time of each modules 
in MPEG2 encoding process was recorded by running the program that implements the 
MPEG2 encoding algorithm on a single-processor machine. The uncompressed video data 
we used for the evaluation was two clips of frames captured, respectively, from the two 
given video tapes, EFE Flight Experiment for air bubbles in a tank and glass tube 
(NASA1) and the Various Drop Physics Module test (NASA2). The frame resolution of 
the digital video is 256 by 240 pixels. Table 2.1 and table 2.2 list the computation time and 
its corresponding percentage for each module of the MPEG2 encoder. Figure 2. 1 and 
figure 2.2 further illustrate the percentage of computation load for each module. 

In tables 2. 1 and 2.2, ME stands for motion estimation module, Pred for prediction 
module, TR for DCT type estimation and forward DCT module, Q&VLC for quantization 
and variable length coding module, IQ for inverse quantization module, and IDCT for 
inverse DCT. From the tables and figures, it is clear that the crucial modules for the 
MPEG2 encoding process are the module ME and the module TR. This can also be 
recognized from the analysis of their computations. Figure 2.3 to 2.5 illustrate the flow 
charts of the module ME for frame pictures without using dual prime motion vectors 
(since dual prime motion vectors are only permitted for continuous P type pictures). 

As seen from figure 2.3, no motion estimation is carried out for I type pictures 
since they are to be coded with only the information of themselves. For P type pictures, 
motion estimation is carried out MB by MB for both frame coding mode and field coding 
mode. Motion estimation is to find out, by means of some search technique, the best 
match of the MB in a reference picture. The search is usually limited within a window 
centered around the location of the MB. This best match is determined by the minimum 
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Table 2. 1 Time and percentage by modules of MPEG2 encoding process (NASA1) 


Functional 

I type pictures 

P type pictures 

B type pictures 

Modules 

Time(sec) 

Percentage 

Time(sec) 

Percentage 

Time(sec) 

Percentage 

ME 

0.05 

1.2% 

10.756 

75.8% 

8.614 

72.3% 

Pred 

0 

0.0% 

0.046 

0.3% 

0.0825 

0.7% 

TR 

2.609 

66.1% 

2.326 

16.4% 

2.335 

19.6% 

Q&VLC 

0.851 

21.5% 

0.668 

4.7% 

0.567 

4.8% 

IQ 

0.165 

4.2% 

0.165 

1.2% 

0.101 

0.8% 

IDCT 

0.275 

7.0% 

0.229 

1.6% 

0.218 

1.8% 


Table 2.2 Time and percentage by modules of MPEG2 encoding process (NASA2) 


Functional 

I type pictures 

P type pictures 

B type pictures 

Modules 

Time(sec) 

Percentage 

Time(sec) 

Percentage 

Time(sec) 

Percentage 

ME 

0.028 

0.75% 

7.754 

69.7% 

7.012 

68.2% 

Pred 

0.028 

0.75% 

0.037 

0.4% 

0.078 

0.7% 

TR 

2.397 

60.4% 

2.307 

20.7% 

2.321 

22.6% 

Q&VLC 

0.934 

24.5% 

0.659 

5.9% 

0.540 

5.3% 

IQ 

0.192 

5.0% 

0.138 

1.2% 

0.110 

1.1% 

IDCT 

0.33 

8.6% 

0.229 

2.1% 

0.220 

2.1% 


distance defined as summed absolute differences between the pixel values of the MB and 
those of the reference picture and is sometimes called the prediction of the MB from the 
reference picture. The corresponding displacement between the MB and its best match, 
called Moving Vector (MV), can be used for motion compensation during decoding. The 
search process is a computational intensive process. To get better results by comparison, 
this search process is repeated for frame coding mode and various prediction combinations 
in field coding mode as shown in figure 2.4. This repetition gravitates the computation 
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load. The worst situation, however, happens when doing motion estimation for B type 
pictures. In this case, two reference pictures are to be searched, corresponding to forward 
prediction and backward prediction respectively as shown in figure 2.5. These two blocks 
each includes the computations corresponding to the blocks encircled by the dashed 
rectangle in figure 2.4. The symbols dmc, dmcftt, dmcftb, dmcft, dmcfbt, dmcfbb, dmcfb. 



ME Pred TR Q&VLC IQ IDCT 

M od u les 


Fig. 2.1 Chartforpercentageoftime spentby 
d ifferent mod ules 
with video clip NASA1 



M od u les 

Fig. 2.2 Chartfor percentage oftime spent by 
d ifferen t modules 
with video clip NASA2 
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[Start of motion estimation ] 



Fig. 2.3 Flow chart for motion estimation 


and dmcf in figure 2.4 and the symbols dmc_f, dmc_bi, dmcf_bi_t, dmcfbib, dmcfbi, 
and dmcf_b in figure 2.5 are the minimum distances evaluated at various estimation 
modes. 

To implement the function of motion estimation is more than just searching for the 
best match of the MB and getting the MVs. As seen from figures 2.4 and 2.5, motion 
estimation also includes the estimation of the variance of the MB ( Var in the figures 2.4 
and 2.5), the summed squared motion compensated prediction error ( Smcp in the figures 
2.4 and 2.5), and the summed squared prediction error without motion compensation ( Sp 
in the figures 2.4 and 2.5). Decisions about the MB types such as intra frame or field (see 
figure 2.4 and 2.5), prediction with motion compensation (see figure 2.4 and 2.5), and 
prediction without motion compensation (see figure 2.4) are made based on the 
comparison of these evaluations. From the point of view of computation, however, most 
computation load comes from the search processes. 

For the TR module the computation is straightforward. The computation load 
mainly comes from the six 8 x 8 double precision floating point multiplications and 
additions carried out in the forward DCT part for each MB. 
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Fig. 2.4 Flow chart for motion estimation of P type pictures 
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Fig. 2.5 Flow chart for motion estimation of B type pictures 
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Parallelism can be easily identified at various levels within the two modules. Take 
ME module for example, the six functional blocks at the first stage of the flow chart in the 
case of P type pictures in figure 2.4 can be computed in parallel. In the case of B type 
pictures in figure 2.5, the first stage of computation has more blocks that can be computed 
in parallel (remember each of the two prediction blocks in figure 2.5 contains the 
functional block encircled by the dashed lines in figure 2.4). Parallelism can also be 
observed if we go further into the lower levels. The exploitation of this kind of parallelism 
is called program partitioning [12] However, caution must be taken when one tries to 
utilize the parallelism because the results from different parallel tasks must merge together 
to make comparisons, which implies that some kind of data communication will be 
required. If, with an implementation, the data communication takes more time than the 
computation does, this implementation is not useful in terms of speeding up the process. 

To benefit from a specific parallel implementation of an algorithm with parallelism, 
one must carefully divide the task such that the job assignment well fits the underlying 
implementation. For the module ME we have tried to parallelize it by assigning the 
functional blocks (see figure 2.4 and 2.5) that can be concurrently computed to different 
processors of a multiprocessor computation machine. It turned out, however, that the total 
time spent with multiple processors was even more than that with single processor. This 
implied that the Inter Processor Communication (IPC) time spent was more than the 
computation time. The granularity of this program partitioning does not fit the 
computation power of the individual processor of the multiprocessor system. Even if the 
granularity well fits the computation power, one could not expect a high speedup with the 
limited parallelism in the module ME. This is clear from figures 2.4 and 2.5. It is not 
difficult to see that similar conclusion could also be made if we tried to parallelize the TR 
module by program partitioning. The above analyses suggest that program partitioning is 
not a good way of parallelizing the ME and TR modules. 
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3. 


Multiprocessor Implementation of The ME module and TR module 


Based on the discussion in previous sections, we chose to study the multiprocessor 
implementation of MPEG2 encoding algorithm. Because of their dominance in 
computation time (see Figs. 2. 1 and 2.2), we only considered the two crucial modules, 

ME and TR, in our study. 

The study is carried out on a multiprocessor computation machine built with 
INMOS T805 processors on an IBM PC compatible host computer. The processing speed 
of the processor is 25 MHz. It has 4 bi-directional links that can be used to connect it with 
other processors. The bandwidth of each link is 20 M bits/s. The processes on a processor 
may communicate with the processes on other processors by sending and receiving 
messages through the corresponding links. Extreme care must be taken when one 
programs the communication because a miss match in the type or the number of 
transmission unit would result in a deadlock. All the communications with the outside 
world must be done through a root processor that is connected with the host computer. 
This makes it difficult to debug the programs on all the processors other than the root 
processor. The program can be written in C or FORTRAN. In this study we used the 
Parallel C Version 2.2.2 by 3L Ltd. 

In order to gain a high speedup, we did our parallel implementation by data 
partitioning. That is, to have the processors run the same program but process different 
parts of a whole data set. This is possible because the compression algorithm processes a 
frame of video MB by MB independently. As discussed in section 1, dividing MBs among 
processors will result in high speed up and good scalability. In addition, it also contributes 
to the development and maintainability of the parallel program since the programs on most 
of the processors would be much the same except for the communication parts. 

Specifically, we divided M MBs uniformly among N processors. The picture 
resolution used for this study was 256 by 240. A MB is a square block of 16 by 16 pixels. 
This results in a total number of MBsM= 256. We did experiments with different values 
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a. N - 3; a processor processes b. N = 7; a processor processes 1/7 of the MBs in a frame. 

1/3 of the MBs in a frame. 


Fig. 3.1 Configurations of porcessors for different values of N. 

Each processor runs the same algorithm but processes 
1/N part of a frame of image data. 



Fig. 3.2 Computation evironment for parallel 

implementation of the modules ME and TR. 


of N. For N= \, a processor would process 256 MBs. For N=3, two of the processors 
would process 85 MBs and one of them processes 86. For N = 7, 3 of them process 36 
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MBs and 4 of them process 37. Due to the same reason, the parallelization for module TR 
was also done by data partitioning in the same way. 

The MPEG2 standard was designed to deal with both frame type pictures and field 
type pictures. To simplify parallel programming and debugging, we only considered frame 
type pictures in our study on the parallel implementation. However, we included the field 
coding mode for frame type pictures in the study. 

The processors were configured in binary tree structures. Figure 3.1 shows the 
configurations of the processors and the corresponding data partitioning for each frame of 
image when A^= 3 and 7. 

The code running on each processor is the same as the corresponding part of the 
uniprocessor program except for the instructions for data communication. The root 
processor, however, has some extra code for input and output purpose. Since the 
implementation is only for two modules of the encoder, the parallel program could not run 
independently without a proper “environment”. To provide the necessary environmental 
data for the two modules, files were used to record the necessary data for the two 
modules from the running uniprocessor program. When the parallel program is running, it 
reads the data from the files and write its results into other files which are read in by the 
uniprocessor program to verify the correctness of the parallel implementation. This 
process is illustrated by the diagram shown in figure 3.2. 

For the ME module, several reference frames (only the luminance Y component) 
have to be sent through the communication links from the root processor to each internal 
processor so that the motion estimation could be carried out properly. For TR module, 
two frames of Y, U, and V components is also to be sent to each processor to carry out 
the DCT type estimation as well as DCT transform. In addition, results from internal 
processors must be sent through the links to the root processor. As will be seen in the next 
section, these constituted a large amount of communication. 
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4. 


Experimental Results 


The experiments on the parallel implementation of the two modules of ME and TR 
were carried out with two clips of video and each contains 20 frames of pictures captured 
from the two given tapes, EFE Flight Experiment for air bubbles in a tank and glass 
tubes (NASA1) and the Various Drop Physics Module test (NASA2), at the resolution 
of 256 by 240. The frame rate that could be reached at this resolution is about 5 frames 
per second (fps). Due to memory limitation, long period video clips with large number of 
frames of relatively high resolutions could only be captured at relatively low frame rate so 
that the captured image data could be written to the extended memory by means of DOS 
RAMDRIVE function. 

The communication times and computation times for different number of 
processors were recorded for both clips. These are listed in Tables 5. 1-5.4 In the tables 
the N is the total number of processors used in the computation. The recorded times under 
the column of N = 1 are the computation times that the corresponding module spent for 
computation with single processor implementation. Thus the data under this column is 
served as the basis for the evaluation of speed up with multiprocessor implementations. 
The observed processing times under the columns of Total for N = 3 and N = 7 are the 
times that the module spent for both computation and communication. To analyze the 
effect of communication time on the speed up, the computation times and the 
communication times were also recorded. However, care must be taken in interpreting 
these times because they were only the observed ones at the root processor. For example, 
the total processing time may not be equal to the sum of computation time and the 
communication time due to the overlaps between the computation time and the 
communication time at different processors. From these tables following observation can 
be made: 

1 ) The computation time decreases with the increase of processor number N since 
the job assigned to each individual processor is getting less with larger values of N. On the 
other hand, the communication time increases with the increase of processor number N 
since the communication is more with larger values of N. 


14 


Table 4. 1 Observed times for module ME with video clip NASA1 
Frame # Observed Times (in 64 micro second) 

& N = 1 I N = 3 I N = 7 

Type Comp. Comp. Comm. Total Comp. Comm. Total 

0/1 9298 3758 353 4H9 1594 456 2052 

1/B 1726499 608208 13666 623555 222634 87616 310261 

2/B 1667899 564497 25226 589733 223656 60216 283883 

3/P 2176734 761409 20162 781586 266318 126774 393106 

4/B 1704901 594757 14331 610033 222658 75855 298521 

5^ 1709886 605068 13666 620390 227556 71925 299494 

6/P 2245627 819969 8340 829329 282689 129278 411974 

7/B 1681560 580457 18730 599201 220872 70019 290976 

~m 1712448 607102 13666 622436 223284 84928 308224 

9/P 2158411 780529 8340 789916 290949 93699 384662 

10/B 1685779 597337 13666 612623 223260 77911 301182 

11/B 1740238 604031 17806 621843 223977 84129 308115 

12/P 2071705 737668 8350 747027 277342 89206 366561 

13a 1715725 603100 13666 618391 217243 93271 310525 

14a 1672303 580850 13666 596138 222976 77022 290010 

15/1 9297 3761 353 4U5 1598 457 2051 

16a 1707198 608378 13666 623692 224936 79215 304159 

17a 1703552 609250 13666 624568 226959 77586 304556 

18a 2245690 807575 8340 816961 282624 192532 412759 

19a 2161123 789158 7351 795845 280270 107748 388028 
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Table 4.2 Observed times for module TR with video clip NASA 1 


Frame # 

Observed Times (in 64 micro second) 

& 

N = 1 

N = 3 

z 

II 

Type 

Comp. 

Comp. 

Comm. 

Total 

Comp. 

Comm. 

Total 

0/1 

212095 

73368 

20468 

93836 

29805 

37591 

67457 

1/B 

210362 

72599 

20468 

93067 

29490 

37658 

67210 

2/B 

210290 

72511 

20467 

92977 

29474 

37623 

67157 

3/P 

210471 

72667 

20468 

93134 

29538 

37608 

67207 

4/B 

210367 

72611 

20468 

93078 

29498 

37640 

67199 

5/B 

210331 

72576 

20468 

93043 

29495 

37621 

67179 

6/P 

210616 

72702 

20467 

93169 

29533 

37645 

67241 

7/B 

210412 

72630 

20468 

93096 

29511 

37581 

67201 

8/B 

210354 

72597 

20467 

93066 

29489 

37646 

67198 

9/P 

210486 

72662 

20468 

93128 

29549 

37581 

67193 

10/B 

210234 

72558 

20467 

93027 

29479 

37645 

67186 

1 1/B 

210316 

72579 

20467 

93045 

29503 

37613 

67177 

12/P 

210368 

72624 

20467 

93090 

29538 

37575 

67174 

13/B 

210193 

72568 

20467 

93035 

29496 

37620 

67177 

14/B 

210140 

72558 

20468 

93025 

29504 

37597 

67162 

15/1 

212089 

73365 

20468 

93831 

29806 

37594 

67462 

16/B 

210192 

72568 

20468 

93036 

29494 

37722 

67178 

17/B 

210316 

72613 

20468 

93083 

29533 

37586 

67179 

18/P 

210424 

72674 

20467 

93142 

29564 

37589 

67214 

19/P 

210631 

72746 

20468 

93213 

29570 

37595 

67228 
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Table 4.3 Observed times for module ME with video clip NASA2 


Frame # 

Observed Times (in 64 micro second) 

& 

N = 1 

N = 3 

N = 7 

Type 

Comp. 

Comp. 

Comm. 

Total 

Comp. 

Comm. 

Total 

0/1 

11036 

3759 

353 

4117 

1594 

457 

1757 

1/B 

1329425 

411151 

106898 

518058 

156552 

91181 

247333 

2/B 

1329300 

411504 

r 99104 

510619 

159552 

86394 

245230 

3/P 

1532219 

462179 

171375 

633565 

166616 

138722 

305144 

4/B 

1313115 

427000 

79853 

506863 

167279 

78029 

244909 

5/B 

1409249 

460962 

82179 

543155 

175841 

88539 

263981 

6/P 

1584186 

507640 

155232 

662882 

179109 

141350 

320264 

7/B 

1319275 

417844 

93692 

511624 

158928 

91933 

250537 

8/B 

1336928 

415553 

109288 

524848 

162610 

86055 

248266 

9/P 

1500753 

463243 

149203 

612459 

166069 

131378 

297251 

10/B 

1292721 

597337 

99308 

505557 

155853 

86208 

241662 

1 1/B 

1350369 

420413 

104466 

524889 

159085 

93553 

252246 

12/P 

1437746 

416761 

187726 

604500 

158214 

129354 

286375 

13/B 

1357313 

429484 

87307 

516800 

161506 

84393 

245501 

14/B 

1284958 

399774 

93883 

493666 

155391 

82502 

237497 

15/1 

11036 

3762 

353 

4113 

1590 

457 

1758 

16/B 

1361928 

445798 

60995 

506805 

168397 

86858 

254857 

17/B 

1297323 

396599 

123732 

520341 

156135 

87560 

243193 

18/P 

1502195 

466425 

152996 

619431 

165749 

146695 

312246 

19/P 

1600491 

509835 

116726 

626571 

181291 

118312 

f 299410 
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Table 4.4 Observed times for module TR with video clip NASA2 


Frame # 


( 

Observed T 

imes (in 64 micro second) 


& 

N = 1 

N = 3 

N = 7 

Type 

Comp. 

Comp. 

Comm. 

Total 

Comp. 

Comm. 

Total 

0/1 

212109 

73352 

20467 

93817 

29772 

37697 

67479 

1/B 

211025 

72967 

20468 

93436 

29667 

37651 

67330 

2/B 

211007 

72963 

20467 

93431 

29671 

37636 

67317 

3/P 

211190 

73033 

20468 

93502 

29676 

37674 

67360 

4/B 

211031 

72971 

20467 

93440 

29679 

37633 

67320 

5/B 

211029 

72975 

20468 

93441 

29679 

37634 

67324 

6/P 

211153 

73014 

20468 

93481 

29683 

37653 

67349 

7/B 

211105 

73009 

20468 

93475 

29690 

37562 

67335 

8/B 

211082 

72993 

20468 

93460 

29690 

37627 

67328 

9/P 

211215 

73063 

20468 

93532 

29696 

37640 

67358 

10/B 

211161 

73044 

20467 

93513 

29690 

37655 

67356 

1 1/B 

211117 

73005 

20467 

93470 

29689 

37636 

67335 

12/P 

211205 

73023 

20469 

93492 

29692 

37642 

67346 

13/B 

211126 

73032 

20467 

93500 

29680 

37663 

67357 

14/B 

211100 

73001 

20467 

93468 

29679 

37648 

67336 

15/1 

212128 

73361 

20468 

93829 

29774 

37697 

67479 

16/B 

211067 

72963 

20468 

93431 

29682 

37624 

67315 

17/B 

211043 

72970 

20468 

93438 

29680 

37628 

67318 

18/P 

211235 

73067 

20468 

93535 

29687 

37669 

67367 

19/P 

211178 

73045 

20467 

93511 

29695 

37651 

67356 
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2) For the module ME, obvious variations are observed with different picture 
types. The computation time spent with I type pictures is very small because no motion 
estimation is carried out for them. The computation time with P type pictures is even more 
than that with B type pictures although the computation is more complex with B type 
pictures than with P type ones. This is because the search window with B type pictures is 
smaller. 

3) For the module TR, no obvious variation can be observed with different picture 

types. 

The variations in computation time and communication time with different number 
of processors are further illustrated by the plots shown in figures 4. 1 to 4.4 for video clip 
NASA1, and figures 4.7 to 4.10 for video clip NASA2. It can be seen from these figures 
that as the number of processors grows, the decrease of the observed total processing time 
slows down due to the contribution of the growing communication time. This made the 
observed speedup tend to saturate with the increase of the processor number as illustrated 
by the plots shown in figures 4.5 and 4.6 for video clip NASA1, and figure 4. 1 1 and 4. 12 
for video clip NASA2. From these plots of speedup vs. processor number it is also seen 
that a higher Computation time to Communication time Ratio (CCR) as seen in the case of 
module ME has a higher speed up ratio than a lower CCR does as seen in the case of 
module TR. 

MPEG standards provide means of controlling compressed data rate. A data rate 
corresponds to a quality of reconstructed video and a compression ratio. The higher the 
data rate, the higher the quality of the reconstructed video but the lower the compression 
ratio and vice versa. Table 5.1 lists the compression ratio and video quality description 
corresponding to different data rate. The sample reconstructed images of the two video 
clips are also shown in figures 5.1 through 5.6. 
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I type pictures 




N 


Fig. 4.1 Observed times for module ME with video clip 
NASA1 

(time unit = 64 microseconds). 


P type pictures 



N 

Fig. 4.2 Observed times for module ME with video clip NASA1. 
(time unit = 64 microseconds) 
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N 


Fig. 4.5 Speed up ratio for module ME with video clip NASA1. 


averaged 
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I type pictures 



N 

Fig. 4.7 Observed times for module ME with video clip NASA2. 
(time unit = 64 microseconds) 


P type pictures 



N 

Fig. 4.8 Observed times for module ME with video clip NASA2. 
(time unit = 64 microseconds) 
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Speedup ratio 


Speedup ratio 



N 

Fig. 4.11 Speed up ratio for module ME with video clip NASA2. 


Averaged 


3.5 j 
3 -- 

2.5 -- 
2 -- 

1.6 -- 
1 -■ 

0.6 -- 

0 -I 1 1 1 lit! 

0 1 2 3 4 5 6 7 

N 

Fig. 4.12 Speed up ratio for module TR with video clip NASA2. 
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5. 


Comparison of MPEG2 with Indeo Video 


MPEG and Indeo Video are two widely used video compression techniques. We 
have studied Indeo Video technology during this contract [13], [20], In the following we 
will compare these two technologies in terms of compression ratio and reconstructed 
image quality. Table 5.1 and table 5.2 list the statistics we obtained through our 
experiments. The corresponding samples of the reconstructed as well as the corresponding 
original images with both technologies are shown in figures 5. 1 through 5.6 for MPEG2 
and figures 5.7 through 5.12 for Indeo Video respectively. 

From the statistics shown in the tables and the qualities of the images shown in the 
figures, we can see that MPEG produced better image qualities than the Indeo Video did. 
Severe artifacts pop up with Indeo Video compression at moderate compression ratios 
where no artifacts are visible with MPEG compression. The MPEG compression produced 
images with reasonable qualities at fairly high compression ratios where Indeo Video 
compression produced unrecognizable images. Our experiments suggested that the MPEG 
is superiors to Indeo Video in terms of compression ratio and reconstructed image quality. 
In addition, MPEG is a MB oriented coding which facilitates data partitioning that leads to 
good scalability. Another important issue is that MPEG is an international standard that is 
well expected to be widely accepted by the world 
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Table 5. 1 Compression ratios and qualities at different data rates with MPEG2. 


Data rate 
(Mbps) 

NASA1 

NASA2 

Compression 

ratio 

Quality 

description 

Compression 

ratio 

Quality 

description 

4 

5.53 

very good 

5.63 

very good 

2 

11.05 

good 

11.04 

good 

1 

22.06 

invisible block artifact 

22.05 

invisible block artifact 

0.5 

44.17 

visible block artifact 

43.77 

visible block artifact 

0.1 

65.40 

obvious block artifact 

85.44 

obvious block artifact 


Table 5.2 Compression ratios and qualities at different quantization levels 


with Indeo Video. 


Quatiza- 

NASA1 

NASA2 

tion 

Compression 

Quality 

Compression 

Quality 

level 

ratio 

description 

ratio 

description 

2 

6.39 

very good 

5.58 

very good 

8 

8.83 

visible artifact 

7.26 

visible artifact 

16 

10.14 

very bad 

13.76 

very bad 

18 

22.77 

very bad 

22.10 

very bad 
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a. (Compression ratio = 0, original ) b. (Compression ratio = 5.53) 



e. (Compression ratio = 44. 17) f. (Compression ratio = 65.40) 

Fig. 5.1 Original & reconstructed frames for an I type picture from NASA1 


with MPEG2. 
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c. (Compression ratio = 1 1 .05) d. (Compression ratio = 22.06) 








c. (Compression ratio = 1 1.05) d. (Compression ratio = 22.06) 



e. (Compression ratio = 44. 17) f. (Compression ratio = 65.40) 


Fig. 5.3 Original & reconstructed frames for a P type picture from NASA1 
with MPEG2. 
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a. (Compression ratio = 0, original) b. (Compression ratio = 5.63) 



c. (Compression ratio = 1 1 .04) d. (Compression ratio = 22.05) 

■m wmmsam ik m wmmm 



e. (Compression ratio = 43.77) f. (Compression ratio = 85.44) 


Fig. 5.4 Original & reconstructed frames for an I type picture from NASA2 
with MPEG2. 
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a. (Compression ratio = 0, original) b. (Compression ratio = 5.63) 



c. (Compression ratio = 1 1 .05) d. (Compression ratio = 22.05) 



e. (Compression ratio = 44. 17) f. (Compression ratio = 85.44) 

Fig. 5.5 Original & reconstructed frames for a B type picture from NASA2 


with MPEG2. 
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c. (Compression ratio = 1 1.04) 



d. (Compression ratio = 22.05) 



e. (Compression ratio = 43.77) f. (Compression ratio = 85.44) 

Fig. 5.6 Original & reconstructed frames for a P type picture from NASA2 


with MPEG2. 
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a. (Compression ratio = 0, original) 


b. (Compression ratio = 6.39) 







a. (Compression ratio = 0, original) b. (Compression ratio = 6.39) 



with Indeo Video. 


35 





c. (Compression ratio = 8.83) d (Compression ratio =10.14) 



e. (Compression ratio = 22.77) 


Fig. 5.9 Original & reconstructed frames for a relative picture from NASA1 


with Indeo Video. 
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a. (Compression ratio = 0, original) b. (Compression ratio = 5.58) 



c. (Compression ratio = 7.26) d. (Compression ratio = 13.76) 



e. (Compression ratio = 22.10) 

Fig. 5.10 Original & reconstructed frames for a reference picture from NASA2 
with Indeo Video. 
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a. (Compression ratio = 0, original) b (Compression ratio = 5.58) 



c. (Compression ratio = 7.26) d. (Compression ratio = 13.76) 



e. (Compression ratio = 22.10) 


Fig. 5.11 Original & reconstructed frames for a relative picture from NASA2 
with Indeo Video. 
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a. (Compression ratio = 0, original) b. (Compression ratio = 5.58) 



c. (Compression ratio = 7.26) d. (Compression ratio = 13.76) 



e. (Compression ratio = 22.10) 

Fig. 5.12 Original & reconstructed frames for a relative picture from NASA2 
with Indeo Video. 
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6 . 


Summary and Discussions 


In this report, we evaluated the different modules of the MPEG2 encoder 
algorithm with two clips of video each containing 20 frames of pictures captured from the 
two given types, EFE Flight Experiment for air bubbles in a tank and glass tubes 
(NASA1) and the Various Drop Physics Module test (NASA2), at the resolution of 256 
by 240. The measured times reconfirmed that the two crucial modules are the ME module 
and the TR module that take more than 90 % of the total computation time of the MPEG2 
encoding process. 

Approaches to implementing video standards were analyzed in terms of 
complexity, applications, and cost. It was argued that, for applications intended by MPEG 
multiprocessor implementation might be a good choice for encoder because it is highly 
unbalanced both in terms of computation and in terms of application as well. This 
approach has the advantages of programmability, good scalability for job assignment by 
data partitioning, and less design complexity. 

Experiments were carried out with the parallel implementation of the two crucial 
modules of the encoding process. We adopted a strategy of data partitioning for job 
assignment. Instead of partitioning algorithms that exhibits limited parallelism, we divided 
the picture frames uniformly on the basis of MBs which are a basic coding unit of the 
major image/video compression standards such as JPEG [16], [19], H.261 [21], as well as 
MPEG1[1] and MPEG2 [2], This type of job partitioning also made our parallel 
programming easier. The most benefit of this type of data partitioning is that good 
scalability can be achieved as seen from the previous section. 

Compared with Indeo video technology [13 ], [20], MPEG can achieve higher 
compression ratio at the same reconstructed video quality as the Indeo video. In addition, 
its MB oriented coding facilitates data partitioning that leads to good scalability. Finally, 
MPEG is an international standard that is well expected to be widely accepted. 


40 


7. 


References 


[1] “ISO/IEC 1 1 172-2: Coding of moving pictures and associated audio for digital 
storage media at up to about 1.5 Mbits/s - Part 2: Video”, International Standards 

Organization, August 93. 

[2] “ISO/IEC DIS 13818-2: Generic coding of moving pictures and associated audio 
information - Part 2: Video ”, International Standards Organization, May 94. 

[3] Thierry Fautier, “VLSI Implementation of MPEG Decoders”, in Proc. IEEE 
ISCAS 94 Tutorials, May 1994. 

[4] Bryan Ackland, “VLSI Architectures for Multimedia and Video Conferencing”, 
in Proc. IEEE ISCAS 94 Tutorials, May 1994. 

[5] Dave Bursky, “Parallelism Pushes DSP Throughput”, Electronic Design Vol.42, 

No. , Mach 21, 1994, pp-15 1-154. 

[6] Peter Pirsch, “VLSI Architectures for Video Compression — A Survey ”, Proc. 
IEEE, Vol. 83, No. 2, Feb. 1995, pp 220-246. 

[7] Frans Sijstermans and Jan Van Der Meer, “CD-I Full-Motion Video Encoding on a 
Parallel Computer”, Comm. ACM, Vol. 34, NO. 4, April 1991, pp 82-91. 

[8] Jack Shandle, “Parallel Processing Comes Down to Earth ”, Electronic Design 
Vol. 41, No. 17, Aug. 19, 1993, pp 56-62. 

[9] Dan Strassberg, “To Multiprocess or not To Mulitprocess ?”, EDN, June 23, 

1994, pp 64-72. 

[10] Jeffrey Child, “Realtime Video Compression Poses Challenges to designers and 
Vendors Alike”, Computer Design, Vol. 32, No. 7, July 1993, pp 67-82. 

[11] Doug Bailey, “The Realtime Virtues of Programmability ”, Computer Design, 

Vol. 32, No. 7, July 1993, pp 71-76. 

[12] Kai Hwang, Advanced Computer Architecture: Parallelism, Scalability, 
Programmability, McGraw-Hill, Inc., 1993. 


41 


[13] Quaterly report for this contract, 06-16-94 to 09-15-94. 

[14] Peter A. Ruetz, Po Tong, Douglas Bailey, Daniel A. Luthi, and Peng H. Ang, 

“A High-Performance Full-Motion Video Compression Chip Set ”, IEEE Trans. 
Circuits Syst. Video Technol., Vol. 2, No. 2, June 1992. 

[ 1 5] Rangarahan Aravind, Glenn L. Cash, Dnald L. Duttweiler, Hsueh-Ming Hang, 
Barry G. Haskell, and Atul Puri, “Image and video Coding Standards”, AT&T 
Technical Journal, January/February 1993, pp 67-88. 

[16] Gregory K. Wallace, “The JPEG Still Picture Compression Standard”, 
Communications of The ACM, Vol. 34, No. 4, April 1991, pp 31-45. 

[17] Didier Le Gall, “MPEG: A Video Compression Standard for Multimedia 
Applications”, Communications of The ACM, Vol. 34, No. 4, April 1991, 
pp 47-58. 

[ 1 8] Ming Liou, “Overview of the P*64 kbits/s Video Coding Standard”, 
Communications of The ACM, Vol. 34, No. 4, April 1991, pp 60-63. 

[ 1 9] William B. Pennebaker, and Joan L. Mitchell, JPEG Still Image Data 

Compression Standard, Van Nostrand Reinhold, 1993. 

[20] Mark J. Bunzel, and Sandra K. Morris, Multimedia Applications Development 
Using Indeo m Video and DVI Technology , McGraw-Hill, Inc., 1994. 

[21] S. Okubo, “Reference Model Methodology — A Tool for the Collaborative 

Creation of Video Coding Standards”, ”, Proc. IEEE, Vol. 83, No. 2, Feb. 1995, 
pp 139-150+. 


42 


Appendix C Code Programs That Were Used In the Experiments 


This appendix includes the programs that we used for our experiments. The 
parallel implementation was based on uniprocessor programs that were adapted from 
those developed by the MPEG Software Simulation Group. Part one is for module ME 
and part two for module TR. Each part includes 3 sets of programs. The first one runs on 
personal computers with 32 bit C code compiler. It is for providing data for the second set 
of programs that is a parallel implementation of module ME and runs on INMOS T805 
based parallel computers with 3L Ltd.’s parallel C compiler version 2.2.2. Like the first set 
of programs, the last set runs on personal computers with 32 bit C compiler. It reads data 
produced by the second set of programs and produces compressed data that is MPEG2 
compatible and can be decoded by any MPEG2 decoding program. The purpose of the last 
set of programs is thus for verifying the correctness of the parallel program that 
implements the ME module. 

The part one has following three sets of programs: 

1 . The first set of programs for providing data. It includes programs: 
encoder.cpp, putseql.cpp, motion. cpp, transfrm.cpp, and putpic.cpp. 

2. The second set of programs for parallel implementation of module ME with 7 
processors. It includes programs: mO.c, mOl.c, m02.c, mOll.c, m012.c, m021.c, 
and m022.c 

3. The third set of programs for verifying the parallel programs implementing module 
ME. It includes programs: encoder.cpp, putseq2.cpp, transfrm.cpp, and 
putpic.cpp 

The part two has following three sets of programs: 

1 . The first set of programs for providing data. It includes programs: 
encoder.cpp, putseq4.cpp, motion.cpp, transfrm.cpp, and putpic.cpp. 

2. The second set of programs for parallel implementation of module TR with 7 
processors. It includes programs: tO.c, tOl.c, t02.c, tOll.c, t012.c, t021.c, and 
t022.c. 



3 The third set of programs for verifying the parallel programs implementing module 
TR. It includes programs: encoder.cpp, putseq5.cpp, motion.cpp, and 
putpic.cpp. 


MPEG-2 Test Sequence, 30 frames/sec /* parameter file for running encoding programs */ 

ns2_%02d /* name of source files *1 

q%d /* name of reconstructed images don't store) */ 

/* name of intra quant matrix file default matrix) */ 

/* name of non intra quant matrix file default matrix) */ 
stat.out /* name of statistics file stdout ) */ 

0 /* input picture file format: 0=*.Y,*.U,*.V, l=*.yuv, 2=*.ppm */ 

20 /* number of frames */ 

0 /* number of first frame */ 

00 .00:00:00 /* timecode of first frame *1 
15 /* N (# of frames in GOP) */ 

3 /* M (I/P frame distance) */ 

0 /* ISO/IEC 1 1 172-2 stream */ 

0 /* 0:frame pictures, l:field pictures */ 

256 /* honzontal_size */ 

240 /* vertical_size */ 

2 /* aspect_ratio_information l=square pel, 2=4:3, 3=16:9, 4=2.11:1 */ 

5 /* frame_rate_code 1=23.976, 2=24, 3=25, 4=29.97, 5=30 frames/sec. */ 

4000000.0 /* bit_rate (bits/s) */ 

1 12 /* vbvbuffersize (in multiples of 16 kbit) */ 

0 /* low_delay */ 

0 /* constrained_parameters_flag */ 

4 /* Profile ED: Simple = 5, Main = 4, SNR = 3, Spatial = 2, High = 1 *! 

8 /* Level ID: Low = 10, Main = 8, High 1440 = 6, High = 4 */ 

0 /* progressive sequence */ 

1 /* chroma_format: 1=4:2:0, 2=4:2:2, 3=4:4:4 */ 

2 /* video_format: 0=comp., 1=PAL, 2=NTSC, 3=SECAM, 4=MAC, 5=unspec. */ 

5 /* color_primaries */ 

5 /* transfer characteristics */ 

4 /* matrix coefficients */ 

256 /* display_horizontal_size */ 

240 /* display_vertical_size */ 

0 /* intra_dc_precision (0: 8 bit, 1 : 9 bit, 2: 10 bit, 3: 1 1 bit */ 

1 I* top_field_first */ 

0 0 0 /* frame_pred_frame_dct (I P B) */ 

0 0 0 /* concealment_motion_vectors (I P B) */ 

111 /* q_scale_type (I P B) */ 

10 0 /* intra vlc format (I P B)*/ 

0 0 0 /* altemate scan (I P B) */ 

0 /* repeat_first_field */ 

0 /* progressive_frame */ 

0 /* P distance between complete intra slice refresh */ 

0 /* rate control: r (reaction parameter) */ 

0 /* rate control: avg act (initial average activity) */ 

0 /* rate control: Xi (initial 1 frame global complexity measure) */ 

0 /* rate control: Xp (initial P frame global complexity measure) */ 

0 /* rate control: Xb (initial B frame global complexity measure) */ 

0 /* rate control: dOi (initial I frame virtual buffer fullness) */ 

0 /* rate control: dOp (initial P frame virtual buffer fullness) */ 

0 /* rate control . dOb (initial B frame virtual buffer fullness) */ 

2 2 11 1 1 /* P: forw_hor_f_code forw_vert_f_code search_width/height */ 

113 3 /* Bl: forw hor f code forw vert f code search width/height */ 

117 7 /* Bl: back_hor_f_code back vert f code search width/height */ 

117 7 /* B2: forw_hor_f_code forw vert f code search width/height */ 


113 3 /* B2: backhorfcode back_vert_f_code search width/height *1 


/* config.h, configuration defines */ 

/* define NON_ANSI_COMPILER for compilers without function prototyping */ 
/* #define NON_ANSI_COMPILER V 

#ifdef NONAN SI_COMPILER 
^define _ANSI_ARGS_(x) () 

#else 

#define _ANSI_ARGS_(x) x 
#endif 


/* global h, global variables, function prototypes 


*/ 


/* choose between declaration (GLOBAL undefined) 

* and definition (GLOBAL defined) 

* GLOBAL is defined in exactly one file (mpeg2enc.c) 

*/ 

#include "c:\nick\mpeg\org\mpeg2enc.h" 

#ifndef GLOBAL 
#define EXTERN extern 
#else 

#define EXTERN 
#endif 

/* prototypes of global functions V 

void range_checks _ANSI_ARGS_((void)); 
void profile_and_level_checks _ANSI_ARGS_(()); 

void initfdct _ANSI_ARGS_((void)); 
void fdct _ANSI_ARGS_((short *block)); 

void idct _ANSl_ARGS_((short *block)); 
void init idct _ANSI_ARGS_((void)); 

void motion_estimation _ANSI_ARGS_((unsigned char *oldorg, 
unsigned char *neworg, 
unsigned char *oldref, 
unsigned char *newref, 
unsigned char *cur. 
int sxf, 
int syf, 
int sxb, 
int syb, 

struct mbinfo *mbi)); 

void error _ANSI_ARGS_((char *text)); 

void predict _ANSI_ARGS_((unsigned char *reffl), 
unsigned char *refb[], 
unsigned char *cur[3], 
int secondfield, struct mbinfo *mbi)); 

void initbits _ANSI_ARGS_((void)); 
void putbits _ANSI_ARGS_(( int val, int n)); 
void alignbits _ANSI_ARGS_((void)); 
int bitcount _ANSI_ARGS_((void)), 

void putseqhdr _ANSI_ARGS_((void)); 

void putseqext _ANSl_ARGS_((void)); 

void putseqdispext _ANSI_ARGS_((void)); 

void putuserdata _ANSI_ARGS_((char *userdata)); 

void putgophdr _ANSI_ARGS_((int frame, int closed_gop)); 


void putpiclhdr _ANSI_ARGS_((void)); 
void putpictcodext _ANSI_ARGS_((void)); 
void putseqend _ANSI_ARGS_((void)); 

void putintrablk _ANSI_ARGS_((short *blk, int cc)); 
void putnonintrablk _ANSI_ARGS_((short *blk)); 
void putmv _ANSI_ARGS_((int dmv, int f_code)); 

void putpict _ANSI_ARGS_((unsigned char *frame)); 

void putseq _ANSI_ARGS_((void)); 

void putDClum _ANSI_ARGS_((int val)); 

void putDCchrom _ANSI_ARGS_((int val)); 

void putACfirst _ANSI_ARGS_((int run, int val)); 

void putAC _ANSI_ARGS_((int run, int signedjevel, int vlcformat)); 

void putaddnnc _ANSI_ARGS_((int addrinc)); 

void putmbtype _ANSI_ARGS_((int pict_type, int mb_type)); 

void putmotioncode _ANSI_ARGS_((int motion_code)); 

void putdmv _ANSI_ARGS_((int dmv)); 

void putcbp _ANSI_ARGS_(( int cbp)); 

int quant_intra _ANSI_ARGS_((short *src, short *dst, int dc_prec, 
unsigned char *quant_mat, int mquant)); 
int quant_non_intra _ANSI_ARGS_((short *src, short *dst, 
unsigned char *quant_mat, int mquant)); 
void iquant_intra _ANSI_ARGS_((short *src, short *dst, int dc_prec, 
unsigned char *quant_mat, int mquant)); 
void iquant_non_intra _ANSI_ARGS_((short *src, short *dst, 
unsigned char *quant_mat, int mquant)); 

void rc_init_seq _ANSI_ARGS_((void)); 

void rc_init_GOP _ANSI_ARGS_((int np, int nb)); 

void rc_init_pict _ANSI_ARGS_((unsigned char *frame)); 

void rc_update_pict _ANSI_ARGS_((void)); 

int rc_start_mb _ANSI_ARGS_((void)); 

int rc_calc_mquant _ANSI_ARGS_((int j)); 

void vbv_end_of_picture _ANSI_ARGS_((void)); 

void calc_vbv_delay _ANSI_ARGS_((void)); 

void readframe _ANSI_ARGS_((char *fname, unsigned char *frame[])); 

void transform _ANSI_ARGS_((unsigned char *pred[], 
unsigned char *cur[], 
struct mbinfo *mbi, 
short blocks[][64], 
char forwardinvers)); 

void itransform _ANSI_ARGS_((unsigned char *pred[], 
unsigned char *cur[], 
struct mbinfo *mbi, 
short blocks[][64])); 

void dct_type_estimation _ANSI_ARGS_((unsigned char *pred, 
unsigned char *cur, 
struct mbinfo *mbi)); 



/* zig-zag scan */ 

EXTERN unsigned char zig_zag_scan[64] 
tfifdef GLOBAL 

{ 

0, 1 ,8, 16,9,2,3, 10, 1 7,24,32,25, 18, 1 1 ,4,5, 
12,19,26,33,40,48,41,34,27,20,13,6,7,14,21,28, 
35,42,49,56,57,50,43,36,29,22,15,23,30,37,44,51, 
58,59,52,45,38,3 1 ,39,46,53,60,6 1 ,54,47,55,62,63 

} 

#endif 


/* alternate scan */ 

EXTERN unsigned char altemate_scan[64] 
#ifdef GLOBAL 

{ 

0,8,16,24,1,9,2,10,17,25,32,40,48,56,57,49, 
41,33,26,18,3,11,4,12,19,27,34,42,50,58,35,43, 
5 1,59,20,28,5, 13,6, 14,2 1,29,36,44,52,60,37,45, 
53,61,22,30,7,15,23,31,38,46,54,62,39,47,55,63 

} 

#endif 


/* default intra quantization matrix */ 

EXTERN unsigned char default_intra_quantizer_matrix[64] 
#ifdef GLOBAL 

{ 

8, 16, 19, 22, 26, 27, 29, 34, 

16, 16, 22, 24, 27, 29, 34, 37, 

19, 22, 26, 27, 29, 34, 34, 38, 

22, 22, 26, 27, 29, 34, 37, 40, 

22, 26, 27, 29, 32, 35, 40, 48, 

26, 27, 29, 32, 35, 40, 48, 58, 

26, 27, 29, 34, 38, 46, 56, 69, 

27, 29, 35, 38, 46, 56, 69, 83 

} 

#endif 


/* non-linear quantization coefficient table */ 
EXTERN unsigned char non_linear_mquant_table[32] 
#ifdef GLOBAL 

{ 

0, 1, 2, 3, 4, 5, 6, 7, 

8,10,12,14,16,18,20,22, 

24,28,32,36,40,44,48,52, 

56,64,72,80,88,96,104,112 

} 


#endif 


/* non-linear mquant table for mapping from scale to code 

* since reconstruction levels are not bijective with the index map, 

* it is up to the designer to determine most of the quantization levels 
*/ 


EXTERN unsigned char map_non_linear_mquant[ 1 13] 
#ifdef GLOBAL 


{ 

0,1,2,3,4,5,6,7,8,8,9,9,10,10,11,11,12,12,13,13,14,14,15,15,16,16, 
16,17,17,17,18,18,18,18,19,19,19,19,20,20,20,20,21,21,21,21,22,22, 
22,22,23,23,23,23,24,24,24,24,24,24,24,25,25,25,25,25,25,25,26,26, 
26,26,26,26,26,26,27,27,27,27,27,27,27,27,28,28,28,28,28,28,28,29, 
29,29,29,29,29,29,29,29,29,30,30,30,30,30,30,30,31,3 1,3 1,3 1,3 1 
} 

#endif 


/* picture data arrays */ 

/* reconstructed frames */ 

EXTERN unsigned char *newrefframe[3], *oldreflframe[3], *auxframe[3]; 

/* original frames */ 

EXTERN unsigned char *neworgframe[3], *oldorgframe[3], *auxorgframe[3]; 
/* prediction of current frame */ 

EXTERN unsigned char *predframe[3]; 

/* 8*8 block data */ 

EXTERN short (*blocks)[64]; 

/* intra / non_intra quantization matrices */ 

EXTERN unsigned char intra_q[64], inter_q[64]; 

EXTERN unsigned char chrom_intra_q[64],chromJnter_q[64]; 

/* prediction values for DCT coefficient (0,0) */ 

EXTERN int dc_dct_pred[3]; 

/* macroblock side information array */ 

EXTERN struct mbinfo *mbinfo; 

/* motion estimation parameters */ 

EXTERN struct motion data *motion_data; 

/* clipping (^saturation) table */ 

EXTERN unsigned char *clp; 

/* name strings */ 

EXTERN char id_string[256], tplorg[256], tplrefI256]; 

EXTERN char iqname[256], niqname[256]; 

EXTERN char statname[256]; 

EXTERN char errortext[256]; 

EXTERN FILE *outfile, *statfile; /* file descriptors */ 

EXTERN int inputtype; /* format of input frames */ 

EXTERN int quiet; /* suppress warnings */ 


/* coding model parameters */ 


EXTERN tnt N, /* number of frames in Group of Pictures */ 

EXTERN int M; /* distance between I/P frames */ 

EXTERN int P; /* intra slice refresh interval */ 

EXTERN int nframes; /* total number of frames to encode */ 

EXTERN int firameO, tcO; /* number and timecode of first frame */ 

EXTERN int mpegl; /* ISO/IEC IS 1 1 172-2 sequence */ 

EXTERN int fieldpic; /* use field pictures */ ■ 

/* sequence specific data (sequence header) */ 

EXTERN int horizontal_size, vertical_size; /* frame size (pels) */ 

EXTERN int width, height; /* encoded frame size (pels) multiples of 16 or 32 */ 
EXTERN int chrom_width,chrom_height,block_count; 

EXTERN int mb_width, mb_height, /* frame size (macroblocks) */ 

EXTERN int width2, height2, mb_height2, chrom_width2; /* picture size */ 
EXTERN int aspectratio; /* aspect ratio information (pel or display) */ 
EXTERN int frame_rate_code; /* coded value of frame rate */ 

EXTERN double framerate; /* frames per second */ 

EXTERN double bit_rate; /* bits per second */ 

EXTERN unsigned int vbvbuffersize; /* size of VBV buffer (* 16 kbit) */ 
EXTERN int constrparms; /* constrained parameters flag (MPEG-1 only) */ 
EXTERN int load iquant, load_niquant; /* use non-default quant, matrices */ 
EXTERN int load_ciquant,load_cniquant; 


/* sequence specific data (sequence extension) */ 

EXTERN int profile, level; /* syntax / parameter constraints */ 
EXTERN int prog_seq; /* progressive sequence */ 

EXTERN int chroma_format; 

EXTERN int low delay; /* no B pictures, skipped pictures */ 


/* sequence specific data (sequence display extension) */ 

EXTERN int video_format; /* component, PAL, NTSC, SECAM or MAC */ 
EXTERN int color_primaries; /* source primary chromaticity coordinates */ 
EXTERN int transfer characteristics; /* opto-electronic transfer char, (gamma) */ 
EXTERN int matrix coefficients; /* Eg,Eb,Er / Y,Cb,Cr matrix coefficients */ 
EXTERN int display_horizontal_size, display_vertical_size; /* display size */ 


/* picture specific data (picture header) */ 

EXTERN int temp ref; /* temporal reference */ 

EXTERN int pict type; /* picture coding type (I, P or B) */ 

EXTERN int vbv_delay; /* video buffering verifier delay (1/90000 seconds) */ 


/* picture specific data (picture coding extension) *1 


EXTERN int forwhorfcode, forvv_vert_f_code; 

EXTERN int backhorfcode, back_vert_f_code; /* motion vector ranges */ 
EXTERN int dc_prec; /* DC coefficient precision for mtra coded blocks */ 
EXTERN int pict_stmct; /* picture structure (frame, top / bottom field) */ 
EXTERN int topfirst; /* display top field first */ 

/* use only frame prediction and frame DCT (I, P,B, current) */ 

EXTERN int frame_pred_dct_tab[3], frame_pred_dct; 

EXTERN int conceal_tab[3]; /* use concealment motion vectors (I,P,B) */ 
EXTERN int qscale_tab[3], q_scale_type; /* linear/non-linear quantizaton table */ 
EXTERN int intravlc_tab[3], intravlc; /* intra vie format (I,P,B, current) */ 
EXTERN int altscan_tab[3], altscan; /* alternate scan (I, P,B, current) */ 

EXTERN int repeatfirst; /* repeat first field after second field */ 

EXTERN int prog_frame; /* progressive frame */ 


/* mpg2enc.h, defines and types 
^include <time.h> 


*/ 


#define PICTURE_START_CODE OxlOOL 
#define SLICE MIN ST ART OxlOlL 
#define SLICE MAX ST ART OxlAFL 
#define U SERST ARTCODE OxlB2L 
#defme SEQ_START_CODE 0xlB3L 
#define EXTST ARTCODE Ox 1 B5L 
#define SEQ_END_CODE 0xlB7L 
#define GOP_START_CODE Ox 1 B8L 
#define ISO_END_CODE Ox 1 B9L 

tfdefine P ACK ST ART CODE OxlBAL 
#define S Y STEM ST ART CODE OxlBBL 

/* picture coding type */ 

#define I_TYPE 1 
#define P_TYPE 2 
#define B_TYPE 3 
#define D_TYPE 4 


/* picture structure */ 

#define TOP_FIELD 1 
#define BOTTOM FIELD 2 
#define FRAME PICTURE 3 

/* macroblock type */ 

^define MB_INTRA ] 
#define MB PATTERN 2 
#define MBBACKWARD 4 
#define MB FORWARD 8 
#define MB_QUANT 16 

/* motion_type */ 

#define MC_FIELD 1 
#define MC_FRAME 2 
#defme MC16X8 2 
#define MC_DMV 3 

/* mv format */ 

#defme MVFBELD 0 
#define MVFRAME 1 

/* chroma format */ 

#define CHROMA420 1 
#define CHROMA422 2 
#define CHROMA444 3 


/* extension start code IDs */ 


#define SEQ_ID 1 

#define DISP_ID 2 

#define QUANT_ID 3 


#define SEQSCAL_ID 5 
tfdefine PANSCANJD 7 
#defme CODING_ID 8 
^define SPATSCALJD 9 
#define TEMPSCALID 10 

/* inputtype */ 

#defme T_Y_U_V 0 
^define TYUV 1 
#defme T_PPM 2 

/* macroblock information */ 
struct mbinfo { 

int mbtype; /* intra/forward/backward/interpolated */ 

int motiontype; /* frame/field/16x8/dual_prime */ 

int dct_type; /* field/frame DCT */ 

int mquant; /* quantization parameter */ 

unsigned int cbp; I* coded block pattern */ 

int skipped; /* skipped macroblock */ 

int MV[2][2][2]; I* motion vectors *1 

int mv_field_sel[2][2]; /* motion vertical field select */ 

int dmvector[2]; /* dual prime vectors */ 

double act; I* activity measure */ 

float var; /* for debugging */ 

}; 


/* motion data */ 
struct motion_data { 

int forw_hor_f_code,forw_vert_f_code; /* vector range */ 
int sxf.syf; /* search range *1 
int back_hor_f_code,back_vert_f_code; 
int sxb.syb; 

}; 


/* vlc.h, variable length code tables (used by routines in putvlc.c) */ 

/* type definitions for variable length code table entries */ 

typedef struct 

{ 

unsigned char code; /* right justified */ 
char len; 

} VLCtable; 

/* for codes longer than 8 bits (excluding leading zeroes) */ 
typedef struct 
{ 

unsigned short code; /* right justified */ 
char len; 

} sVLCtable; 


/* data from 1SO/IEC 13818-2 DIS, Annex B, variable length code tables */ 

/* Table B-l, variable length codes for macroblockaddressincrement 
* 

* indexed by [macroblock_address_increment-l] 

* 'macroblockescape' is treated elsewhere 

*/ 

static VLCtable addrinctab[33]= 

{ 

{0x01,1}, {0x03,3}, {0x02,3}, {0x03,4}, 

{0x02,4}, {0x03,5}, {0x02,5}, {0x07,7}, 

{0x06,7}, {0x0b,8}, {0x0a,8}, {0x09,8}, 

{0x08,8}, {0x07,8}, {0x06,8}, {0x17,10}, 

{0x16,10}, {0x15,10}, {0x14,10}, {0x13,10}, 

{0x12,10}, {0x23,11}, {0x22,11}, {0x21,11}, 

{0x20,11}, {Oxlf.ll}, {Oxle,ll}, {Oxld,ll}, 

{Oxlc,ll}, {Oxlb,ll}, {Oxla,ll}, {0x19,11}, 

{0x18,11} 

}; 


/* Table B-2, B-3, B-4 variable length codes for macroblocktype 
* 

* indexed by [macroblock type] 

*/ 

static VLCtable mbtypetab[3][32]= 

{ 

1 * 1*1 

{ 

{0,0}, {1,1}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, 

{0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, 

{0,0}, {1,2}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, 

{0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0} 

}, 


/*?*/ 

{ 

{0,0}, {3,5}, {1,2}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, 
{1,3}, {0,0}, {1,1}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, 
{0,0}, {1,6}, {1,5}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0}, 
{0,0}, {0,0}, {2,5}, {0,0}, {0,0}, {0,0}, {0,0}, {0,0} 

/* B */ 

{ 

{0,0}, {3,5}, {0,0}, {0,0}, {2,3}, {0,0}, {3,3}, {0,0}, 
{2,4}, {0,0}, {3,4}, {0,0}, {2,2}, {0,0}, {3,2}, {0,0}, 
{ 0 , 0 }, { 1 , 6 }, { 0 , 0 }, { 0 , 0 }, { 0 , 0 }, { 0 , 0 }, { 2 , 6 }, { 0 , 0 }, 
{0,0}, {0,0}, {3,6}, {0,0}, {0,0}, {0,0}, {2,5}, {0,0} 

} 

}; 


/* Table B-5 ... B-8 variable length codes for macroblocktype in 

* scalable sequences 

* 

* not implemented 
*/ 

/* Table B-9, variable length codes for coded_block_pattem 

* 

* indexed by [coded_block_pattem] 

♦/ 

static VLCtable cbptable[64]= 

{ 

{0x01,9}, {0x0b,5}, {0x09,5}, {0x0d,6}, 

{0x0d,4}, {0x17,7}, {0x13,7}, {0xlf,8}, 

{0x0c,4}, {0x16,7}, {0x12,7}, {0xle,8}, 

{0x13,5}, {0xlb,8}, {0x17,8}, {0x13,8}, 

{0x0b,4}, {0x15,7}, {0x11,7}, {0xld,8}, 

{0x11,5}, {0x19,8}, {0x15,8}, {0x11,8}, 

{0x0f,6}, {0x0f,8}, {0x0d,8}, {0x03,9}, 

{0x0f,5}, {0x0b,8}, {0x07,8}, {0x07,9}, 

{0x0a,4}, {0x14,7}, {0x10,7}, {0xlc,8}, 

{0x0e,6}, {0x0e,8}, {0x0c,8}, {0x02,9}, 

{0x10,5}, {0x18,8}, {0x14,8}, {0x10,8}, 

{0x0e,5}, {0x0a,8}, {0x06,8}, {0x06,9}, 

{0x12,5}, {0xla,8}, {0x16,8}, {0x12,8}, 

{0x0d,5}, {0x09,8}, {0x05,8}, {0x05,9}, 

{0x0c,5}, {0x08,8}, {0x04,8}, {0x04,9}, 

{0x07,3}, {0x0a,5}, {0x08,5}, {0x0c,6} 

}; 


/* Table B-10, variable length codes for motioncode 

* 

* indexed by [abs(motion_code)l 

* sign of motion code is.treated elsewhere 

*/ 


static VLCtable motionvectab[17]= 

{ 

{0x01,1}, {0x01,2}, {0x01,3}, {0x01,4}, 
{0x03,6}, {0x05,7}, {0x04,7}, {0x03,7}, 
{0x0b,9}, {0x0a,9}, {0x09,9}, {0x11,10}, 
{0x10,10}, {0x0f,10}, {OxOe.lO}, {0x0d,10}, 
{OxOc.lO} 

}; 


/* Table B-l 1, variable length codes for dmvector 
* 

* treated elsewhere 
*1 

/* Table B-l 2, variable length codes for dct_dc_size_luminance 

* 

* indexed by [dct_dc_size_luminance] 

*/ 

static sVLCtable DClumtab[12]= 

{ 

{0x0004,3}, {0x0000,2}, {0x0001,2}, {0x0005,3}, {0x0006,3}, {0x000e,4}, 
{0x001e,5}, {0x003e,6}, {0x007e,7}, {0x00fe,8}, {0x01fe,9}, {0x01ff,9} 

}; 


/* Table B-l 3, variable length codes for dct_dc_size_chrominance 

* 

* indexed by [dct_dc_size_chrominance] 

*/ 

static sVLCtable DCchromtab[12]= 

{ 

{0x0000,2}, {0x0001,2}, {0x0002,2}, {0x0006,3}, {0x000e,4}, {0x001e,5}, 
{0x003e,6}, {0x007e,7}, {0x00fe,8}, {0x01fe,9}, {0x03fe,10},{0x03fif,10} 

}; 


/* Table B-14, DCT coefficients table zero 

* 

* indexed by [run] [level- 1] 

* split into two tables (dct_code_tabl, dct_code_tab2) to reduce size 

* 'first DCT coefficient' condition and 'End of Block' are treated elsewhere 

* codes do not include s (sign bit) 

*/ 


static VLCtable dct_code_tabl[2][40]= 

{ 

/* run = 0, level = 1...40 */ 

{ 

{0x03, 2}, {0x04, 4}, {0x05, 5}, {0x06, 7}, 
{0x26, 8}, {0x21, 8}, {0x0a,10}, {0xld,12}, 


{0x18,12}, {0x13,12}, {0x10,12}, {0xla,13}, 
{0x19,13}, {0x18,13}, {0x17,13}, {0xlf,14}, 
{0xle,14}, {0xld,14}, {0xlc,14}, {0xlb,14}, 
{0xla,14}, {0x19,14}, {0x18,14}, {0x17,14}, 
{0x16,14}, {0x15,14}, {0x14,14}, {0x13,14}, 
{0x12,14}, {0x11,14}, {0x10,14}, {0x18,15}, 
{0x17,15}, {0x16,15}, {0x15,15}. {0x14,15}, 
{0x13,15}, {0x12,15}, {0x11,15}, {0x10,15} 
}, 

/*run= 1, level = 1... 18*/ 

{ 

{0x03, 3}, {0x06, 6}, {0x25, 8}, {0x0c,10}, 
{0xlb,12}, {0x16,13}, {0x15,13}, {0xlf,15}, 
{0xle,15}, {0xld,15}, {0xlc,15}, {0xlb,15}, 
{0xla,15}, {0x19,15}, {0x13,16}, {0x12,16}, 
{0x1 1,16}, {0x10,16}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0} 

} 

}; 


static VLCtable dct_code_tab2[30][5]= 

{ 

/* run = 2 ...31, level = 1...5 */ 

{{0x05, 4}, {0x04, 7}, {0x0b,10}, {0x14,12}, {0x14,13}}, 
{{0x07, 5}, {0x24, 8}, {0xlc,12}, {0x13,13}, {0x00, 0}}, 
{{0x06, 5}, {0x0f,10}, {0x12,12}, {0x00, 0}, {0x00, 0}}, 
{{0x07, 6}, {0x09,10}, {0x12,13}, {0x00, 0}, {0x00, 0}}, 
{{0x05, 6}, {0xle,12}, {0x14,16}, {0x00, 0}, {0x00, 0}}, 
{{0x04, 6}, {0x15,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x07, 7}, {0x11,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x05, 7}, {0x11,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x27, 8}, {0x10,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x23, 8}, {0xla,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x22, 8}, {0x19,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x20, 8}, {0x18,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{OxOe.lO}, {0x17,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x0d,10}, {0x16,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x08,10}, {0x15,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlf,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{Oxla,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x19,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x17,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x16,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{Oxlf,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{Oxle,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{Oxld,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{Oxlc,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{Oxlb,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlf,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xle,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 


{{Oxld, 16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlc,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlb,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}} 
}; 


/* Table B-15, DCT coefficients table one 

* 

* indexed by [run] [level- 1] 

* split into two tables (dct_code_tabla, dct_code_tab2a) to reduce size 

* 'End of Block' is treated elsewhere 

* codes do not include s (sign bit) 

*/ 


static VLCtable dct_code_tabla[2][40]= 

{ 

/* run = 0, level = 1 . ..40 */ 

{ 

{0x02, 2}, {0x06, 3}, {0x07, 4}, {Oxlc, 5}, 
{Oxld, 5}, {0x05, 6}, {0x04, 6}, {0x7b, 7), 
{0x7c, 7}, {0x23, 8}, {0x22, 8}, {Oxfa, 8}, 
{Oxfb, 8}, {Oxfe, 8}, {Oxff, 8}, {0xlf,14}, 
{0xle,14}, {Oxld, 14}, {Oxlc, 14}, {0xlb,14}, 
{0xla,14}, {0x19,14}, {0x18,14}, {0x17,14}, 
{0x16,14}, {0x15,14}, {0x14,14}, {0x13,14}, 
{0x12,14}, {0x11,14}, {0x10,14}, {0x18,15}, 
{0x17,15}, {0x16,15}, {0x15,15}, {0x14,15}, 
{0x13,15}, {0x12,15}, {0x11,15}, {0x10,15} 
}, 

/* run = 1, level = 1.. .18 */ 

{ 

{0x02, 3}, {0x06, 5}, {0x79, 7}, {0x27, 8}, 
{0x20, 8}, {0x16,13}, {0x15,13}, {0xlf,15}, 
{0xle,15}, {Oxld, 15}, {0xlc,15}, {0xlb,15}, 
{0xla,15}, {0x19,15}, {0x13,16}, {0x12,16}, 
{0x11,16}, {0x10,16}, {0x00,0}, {0x00,0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}, 
{0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0} 

} 

}; 


static VLCtable dct_code_tab2a[30][5]= 

{ 

/* run = 2. ..31, level = 1...5 */ 

{{0x05, 5}, {0x07, 7}, {Oxfc, 8}, {0x0c,10}, {0x14,13}}, 
{{0x07, 5}, {0x26, 8}, {Oxlc, 12}, {0x13,13}, {0x00, 0}}, 
{{0x06, 6}, {Oxfd, 8}, {0x12,12}, {0x00, 0}, {0x00, 0}}, 
{{0x07, 6}, {0x04, 9}, {0x12,13}, {0x00, 0}, {0x00, 0}}, 
{{0x06, 7}, {0xle,12}, {0x14,16}, {0x00, 0}, {0x00, 0}}, 
{{0x04, 7}, {0x15,12}, .{0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x05, 7}, {0x11,12}, {0x00,0}, {0x00,0}, {0x00,0}}, 


{{0x78, 7}, {0x11,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x7a, 7}, {0x10,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x21, 8}, {0xla,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x25, 8}, {0x19,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x24, 8}, {0x18,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x05, 9}, {0x17,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x07, 9}, {0x16,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x0d,10}, {0x15,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlf,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xla,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x19,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x17,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0x16,12}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlf,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xle,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xld,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlc,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlb,13}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 

{ {Oxlf, 16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xle,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xld,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlc,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}}, 
{{0xlb,16}, {0x00, 0}, {0x00, 0}, {0x00, 0}, {0x00, 0}} 

}; 


constl.h 


/* 


*/ 


#define WORKSPACE 2048 
#define END 0 


/*********************************/ 

/* */ 

/* encoder.cpp */ 

/* */ 

/*********************************/ 

#include <stdio.h> 

^include <stdlib.h> 

#define GLOBAL /* used by global.h */ . 

#include "c:\nick\mpeg\org\config.h" 

#include "c:\nick\mpeg\org\global.h" 

/* private prototypes */ 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

static void readquantmat _ANSJ_ARGS_((void)); 


void main(int argc, char *argv[] ) 

{ 

if (argc! =3) 

{ 

printf("Usage: mpeg2encode in.par out.m2v\n"); 
exit(O); 

} 

/* read parameter file */ 
readparmfile(argv[ 1 ]); 

/* read quantization matrices */ 
readquantmatf); 

/* open output file */ 
if (!(outfile=fopen(argv[2],"wb"))) 

{ 

sprintf(errortext, "Couldn't create output file %s",argv[2]); 
error(errortext); 

} 

init(); 

putseqO; 

fclose(outfile); 


static void init() 

{ 

int i, size; 

static int block_count_tab[3] = {6,8,12}; 

initbitsO; 

init_fdct(); 


init_idct(); 

/* round picture dimensions to nearest multiple of 16 or 32 */ 
mbvvidth = (horizontal_size+15)/16; 

mb_height = prog_seq ? (vertical_size+15)/16 : 2*((vertical_size+3 1 )/32); 
mb_height2 = fieldpic ? mb_height»l : mb_height, /* for field pictures */ 
width = 16*mb_width; 
height = 16*mb_height; 

chromwidth = (chroma_format=CHROMA444) ? width : width»l; 
chrom_height = (chroma_format!=CHROMA420) ? height : height»l; 

height2 = fieldpic ? height»l : height; 

width2 = fieldpic ? width«l : width; 

chrom_width2 = fieldpic ? chrom_width«l : chrom width; 

blockcount = block_count_tab[chroma_format-l], 

I* clip table */ 

if (!(clp = (unsigned char *)malloc(1024))) 
error("malloc failed\n"); 
clp+= 384; 

for (i=-384; i<640; i++) 
clp[i] = (i<0) ? 0 : ((i>255) ? 255 : i), 

for (i=0; i<3; i++) 

{ 

size = (i=0) ? width*height : chrom_width*chrom_height; 

if (!(newrefframe[i] = (unsigned char *)malloc(size))) 
error("malloc failed\n"), 

if (!(oldrefframe[i] = (unsigned char *)malloc(size))) 
error(''malloc failed\n"); 

if (!(auxframe[i] = (unsigned char *)malloc(size))) 
error("malloc failed\n"); 

if (!(neworgframe[i] = (unsigned char *)malloc(size))) 
error("malloc failedVn"); 

if (!(oldorgfirame[i] = (unsigned char *)malloc(size))) 
error("malloc failed\n"); 

if (!(auxorgframe[i] = (unsigned char *)malloc(size))) 
error("malloc failed\n”); 

if (!(predframe[i] = (unsigned char *)malloc(size))) 
error("malloc failed\n"); 

} 

mbinfo = (struct mbinfo *)malloc(mb_width*mb_height2*sizeof(struct mbinfo)); 
if (Imbinfo) error("malloc failed\n"), 
blocks = 

(short (*)[64])malloc(mb_width*mb_height2*block_count*sizeof(short [64])); 


if (Iblocks) error("malloc failed\n"); 


} 


void error(char *te.\t) 

{ 

fprintf(stderr,text); 

putc('\n',stderr); 

exit(l); 

} 

static void readparmfile(char *fname) 

{ 

int i; 

int h,m,s,f; 

FILE *fd; 

char line[256]; 

static double ratetab[8]= 

{24000.0/1001.0,24.0,25.0,30000.0/1001.0,30.0,50.0,60000.0/1001.0,60.0}; 
extern int r,Xi,Xb,Xp,dOi,dOp,dOb; /* rate control */ 
extern double avg_act; /* rate control *1 

if (!(fd = fopen(fhame,"r"))) 

{ 

sprintfierrortext, "Couldn't open parameter file %s",fhame); 
error(errortext); 

} 

fgets(id_string,254,fd); 
fgets(line,254,fd); sscanf(line,''%s",tplorg); 
fgets(line,254,fd); sscanf(line,"%s",tplref); 
fgets(line,254,fd); sscanf(line,"%s",iqname); 
fgets(line,254,fd); sscanf(line,"%s",niqname); 
fgets(line,254,fd); sscanf(line,"%s",statname); 
fgets(line,254,fd); sscanf(line,"%d",&inputtype); 
fgets(line,254,fd); sscanf(line,"%d",&nframes); 
fgets(line,254,fd); sscanf(line,"%d",&frameO); 
fgets(line,254,fd); sscanf(line,"%d:%d:%d:%d",&h,&m,&s,&f); 
fgets(line,254,fd); sscanf(line,"%d",&N); 
fgets(line,254,fd); sscanf(line,"%d",&M); 
fgets(line,254,fd); sscanf(line,"%d”,&mpegl); 
fgets(line,254,fd); sscanf(line,"%d",&fieldpic); 
fgets(line,254,fd); sscanf(line,"%d",&horizontal_size), 
fgets(line,254,fd); sscanf(line,''%d",&vertical_size); 
fgets(line,254,fd); sscanf(line,"%d",&aspectratio); 
fgets(line,254,fd); sscanf(line,"%d",&frame_rate_code); 
fgets(line,254,fd); sscanf(line,"%lf',&bit_rate); 
fgets(line,254,fd); sscanf(line,''%d",&vbv_buffer_size); 
fgets(line,254,fd); sscanf(line,"%d",&low_delay); 
fgets(line,254,fd); sscanf(line,"%d",&constrparms); 
fgets(line,254,fd); sscanf(line,"%d",&profile); 
fgets(line,254,fd); sscanf(line,"%d",&level); 
fgets(line,254,fd); sscanf(line,''%d",&prog_seq); 
fgets(line,254,fd); sscanf(line,"%d",&chroma_format); 
fgets(line,254,fd); sscanf(line,"%d",&video_format); 
fgets(line,254,fd); sscanf(line,"%d",&color_primaries); 


fgets(line,254,fd); sscanf(line,"%d",&transfer_characteristics); 
fgets(line,254,fd); sscanf(line,"%d",&matrix_coefficients); 
fgets(line,254,fd); sscanf(lme,"%d",&display_horizontal_size); 
fgets(line,254,fd); sscanf(line,"%d",&display_vertical_size); 
fgets(line,254,fd); sscanf(line,"%d",&dc_prec); 
fgets(line,254,fd); sscanf(line,"%d",&topfirst), 
fgets(line,254,fd); sscanf(line,”%d %d %d", 
frame_pred_dct_tab,frame_pred_dct_tab+ 1 ,frame_pred_dct_tab+2), 

fgets(line,254,fd); sscanf(line, H %d %d %d", 
conceal_tab,conceal_tab+l ,conceal_tab+2); 

fgets(line,254,fd); sscanf(line,"%d %d %d", 
qscale_tab,qscale_tab+l,qscale_tab+2); 

fgets(line,254,fd); sscanf(line,"%d %d %d", 
intravlc_tab,intravlc_tab+ 1 .intravlc _tab+2); 
fgets(hne,254,fd); sscanf(line,"%d %d %d", 
altscan_tab,altscan_tab+ 1 ,altscan_tab+2); 
fgets(line,254,fd); sscanf(line,''%d",&repeatfirst); 
fgets(line,254,fd); sscanf(line,''%d",&prog_frame); 

/* intra slice interval refresh period */ 
fgets(line,254,fd); sscanf(line,"%d",&P); 
fgets(line,254,fd); sscanf(line,"%d",&r); 
fgets(line,254,fd); sscanf(line, n %lf',&avg_act); 
fgets(line,254,fd); sscanf(line,"%d",&Xi); 
fgets(line,254,fd); sscanf(line,"%d",&Xp); 
fgets(line,254,fd); sscanf(line,"%d",&Xb); 
fgets(line,254,fd); sscanf(line,"%d",&dOi); 
fgets(line,254,fd); sscanf(line,"%d",&dOp); 
fgets(line,254,fd); sscanf(line,"%d",&dOb); 

if (N<1) error("N must be positive"); 

if (M<1) error("M must be positive"); 

if (N%M != 0) error("N must be an integer multiple of M"); 

motion data = (struct motion data *)malloc(M*sizeof(struct motion data)); 
if (!motion_data) error("malloc failed\n"); 

for (i=0; i<M; i++) 

{ 

fgets(line,254,fd); 
sscanf(line,"%d %d %d %d", 

&motion_data[i].forw_hor_f_code, &motion_data[i].for\v_vert_f_code, 
&motion_data[i].sxf, &motion_data[i].syf), 

if (i!=0) 

{ 

fgets(line,254,fd); 
sscanf(line,"%d %d %d %d", 

&motion_data[i] .backhorfcode, &motion_data[i] backvertfcode, 
&motion_data[i].sxb, &motion_data[i].syb); 

} 


fclose(fd); 

/* make flags boolean (x!=0 -> x=l) */ 

mpegl = Mmpegl; 

fieldpic = Mfieldpic; 

lowdelay = Mlowdelay; 

constrparms = ! Iconstrparms; 

prog_seq = !!prog_seq; 

topfirst = Htopfirst; 


for (i=0; i<3; i++) 

{ 

frame _pred_dct_tab[i] = !!frame_pred_dct_tab[i]; 
conceal_tab[i] = !!conceal_tab[i]; 
qscale_tabli] = !!qscale_tab[i]; 
intravlc_tab[i] = !!intravlc_tab[i]; 
altscan_tab[i] = !!altscan_tab[i]; 

} 

repeatfirst = Hrepeatflrst; 
progframe = ! Iprogframe; 

/* make sure MPEG specific parameters are valid */ 
range_checks(); 

frame_rate = ratetab[frame_rate_code-l]; 

/* timecode -> frame number */ 
tcO = h; 

tcO = 60*tc0 + m; 
tcO = 60*tc0 + s; 

tcO = (int)(frame_rate+0.5)*tc0 + f; 

if (Impegl) 

{ 

profile_and_level_checks(); 

} 

else 

{ 

/* MPEG-1 */ 
if (constrparms) 

{ 

if (horizontal_size>768 
|| vertical_size>576 

|| ((horizontal_size+15)/16)*((vertical_size+15)/16)>396 
|| ((horizontal_size+ 15)/16)*((vertical_size+ 1 5)/ 1 6)* frame_rate>3 96 *25.0 
|| frame_rate>30.0) 

{ 

if (Iquiet) 

fprintf(stderr, "Warning: setting constrained _parameters_flag = 0\n"); 
constrparms = 0; 

} 

> 


if (constrparms) 

{ 

for (i=0; i<M; i++) 

{ 

if (motion_data[i].forw_hor_f_code>4) 

{ 

constrparms = 0; 
break; 

} 

if (motion_data[i].forw_vert_f_code>4) 

{ 

constrparms = 0; 
break; 

} 

if (i!=0) 

{ 

if (motion_data[i].back_hor_f_code>4) 

{ 

constrparms = 0; 
break; 

} 

if (motion_data[i].back_vert_f_code>4) 

{ 

constrparms = 0; 
break; 

} 

} 

} 

} 

} 

/* relational checks */ 

if (Impegl && constrparms) constrparms = 0; 

if (prog_seq && Iprog ffame) prog_frame = 1; 

if (prog_ffame && fieldpic) fieldpic = 0; 

if (!prog_ffame && repeatfirst) repea tfirst = 0; 

if (prog_ffame) 
for (i=0; i<3; i++) 

if (!firame_pred_dct_tab[i]) frame_pred_dct_tab[i] = 1, 

if (prog_seq && .'repeatfirst && topfirst) topfirst = 0; 

/* search windows */ 
for (i=0; i<M; i++) 

{ 

if (motion_data[i].sxf > (4«motion_data[i].forw_hor_f_code)-l) 


motion_data[i].sxf = (4<<motion_data[i].forvv_hor_f_code)-l; 

if (motion_data[i].syf > (4<<motion_data[i].forw_vert_f_code)-l) 
motion_data[i].syf = (4<<motion_data[i].forvv_vert_f_code)-l ; 

if (i!=0) 

{ 

if (motion_data[i].sxb > (4«motion_data[i].back_hor_f_code)-l) 
motion_data[i]sxb = (4<<motion_data[i].back_hor_f_code)-l; 

if (motion_data[i].syb > (4«motion_data[i].back_vert_f_code)-l) 
motion_data[i]syb = (4<<motion_data[i].back_vert_f_code)-l; 

} 

} 


static void readquantmat() 

{ 

int i,v; 

FILE *fd; 

if (iqname[0]— -’) 

{ 

/* use default intra matrix */ 

Ioad_iquant = 0; 
for (i=0; i<64; i++) 

intra_qli] = default_intra_quantizer_matrix[i]; 

} 

else 

{ 

/* read customized intra matrix */ 

load_iquant = 1; 

if (!(fd = fopen(iqname, "r"))) 

{ 

sprintf(errortext, "Couldn't open quant matrix file %s",iqname); 
error(errortext); 

} 

for (i=0; i<64; i++) 

{ 

fscanf(fd,"%d",&v); 
if (v<l || v>255) 

error("invalid value in quant matrix"); 
intra_q[i] = v, 

} 

fclose(fd); 

} 

if (niqname[0]='-’) 

{ 

/* use default non-intra matrix */ 
load_niquant = 0; 


for (i=0, i<64; i++) 
inter_q[i] = 16; 

} 

else 

{ 

/* read customized non-intra matrix */ 

loadniquant = 1; 

if (!(fd = fopen(niqname,"r"))) 

{ 

sprintf(errortext,"Couldn't open quant matrix file %s",niqname); 
error(errortext); 

} 

for (i=0; i<64; i++) 

{ 

fscanf(fd,"%d",&v); 
if (v<l || v>255) 

error("invalid value in quant matrix"); 
inter_q[i] = v; 

} 

fclose(fd); 

} 

} 

/* check for (level independent) parameter limits */ 
void range_checks() 

{ 

int i; 

/* range and value checks •/ 

if (horizontal_size<l || horizontal_size> 16383) 
error("horizontal_size must be between 1 and 16383"); 
if (mpegl && horizontal_size>4095) 
error("horizontal_size must be less than 4096 (MPEG-1)”); 
if ((horizontal_size&4095)=0) 
error("horizontal_size must not be a multiple of 4096"); 
if (chroma format ! =CHROMA444 && horizontal_size%2 != 0) 
error("horizontal_size must be a even (4:2:0 / 4:2.2)"); 

if (vertical_size<l || vertical_size>16383) 
error("vertical_size must be between 1 and 16383"); 
if (mpegl && vertical_size>4095) 
error("vertical size must be less than 4096 (MPEG-1)"); 
if ((vertical_size&4095)=0) 
error("vertical_size must not be a multiple of 4096"); 
if (chroma_format— CHROMA420 && vertical_size%2 != 0) 
error("vertical_size must be a even (4:2:0)"); 
if(fieldpic) 

{ 

if (vertical_size%2 != 0) 

error("vertical_size must be a even (field pictures)"); 
if (chroma_format=CHROMA420 && vertical_size%4 != 0) 


error("vertical_size must be a multiple of 4 (4:2:0 field pictures)"); 

} 

if (mpegl) 

{ 

if (aspectratio<l || aspectratio>14) 

error("pel_aspect_ratio must be between 1 and 14 (MPEG-1)"); 

} 

else 

{ 

if (aspectratio<l || aspectratio>4) 
error("aspect_ratio_information must be 1, 2, 3 or 4"); 

} 

if (frame_rate_code< 1 1| frame_rate_code>8) 
error("frame_rate code must be between 1 and 8"); 

if (bit_rate<=0.0) 
error("bit_rate must be positive"); 
if (bitrate > ((1«30)-1)*400.0) 
error("bit_rate must be less than 429 Gbit/s"); 
if (mpegl && bit_rate > ((1«18)-1)*400.0) 
error("bit_rate must be less than 104 Mbit/s (MPEG-1)"); 

if (vbv_buffer_size<l || vbv_buffer_size>0x3fEBf) 
error("vbv_buffer_size must be in range l..(2 A 18-l)"); 
if (mpegl && vbv_buffer_size>=1024) 
error("vbv_buffer_size must be less than 1024 (MPEG-1)"); 

if (chroma_format<CHROMA420 1| chroma_format>CHROMA444) 
enor("chroma_format must be in range 1...3"); 

if (video_format<0 || video_format>4) 
error("video_format must be in range 0...4"); 

if (color_primaries<l || color_primaries>7 || color_primaries=3) 
error("color_primaries must be in range 1...2 or 4. ..7"); 

if (transfer_characteristics<l || transfer_characteristics>7 
|| transfer_characteristics=3) 

error("transfer_characteristics must be in range 1...2 or 4.. .7"); 

if (matrix_coefficients<l || matrix_coefficients>7 || matrix_coefficients=3 ) 
enor("matrix_coeflicients must be in range 1...2 or 4.. .7"); 

if (display_horizontal_size<0 || display_horizontal_size> 16383) 
error("display_horizontal_size must be in range 0... 16383"); 
if (display_vertical_size<0 || display_vertical_size>16383) 
error("display_vertical_size must be in range 0... 16383"); 

if (dc_prec<0 || dc_prec>3) 
error("intra_dc_precision must be in range 0...3”); 


for (i=0; i<M; i++) 


if (motion_data[i].forw_hor_f_code<l || motion_data[i].forw_hor_f_code>9) 
error("f_code must be between 1 and 9"); 
if (motion_data[i].forw_vert_f_code<l || motion_data[i] forw_vert_f_code>9) 
error("f_code must be between 1 and 9"); 
if (mpegl && motion_data[i].forw_hor_f_code>7) 
error("f_code must be le less than 8"); 
if (mpegl && motion_data[i].forw_vert_f_code>7) 
error("f_code must be le less than 8"); 
if (motion_data[i].sxf<=0) 

error("search window must be positive"); /* doesn't belong here */ 
if (motion_data[i].syf<=0) 
error("search window must be positive"); 
if (i!=0) 

{ 

if (motion_data[i).back_hor_f_code<l || motion_data[i].back_hor_f_code>9) 
error("f_code must be between 1 and 9"); 

if (motion_data[i].back_vert_f_code<l || motion_data[i].back_vert_f_code>9) 
error("f_code must be between 1 and 9"); 
if (mpegl && moUon_data[i].back_hor_f_code>7) 
error("f_code must be le less than 8"); 
if (mpegl &A motion_data[i].back_vert_f_code>7) 
errorffcode must be le less than 8"); 
if (motion_data[i].sxb<=0) 
errorf'search window must be positive"); 
if (motion_data[i].syb<=0) 
error("search window must be positive"); 

} 

} 

} 

/* identifies valid profile / level combinations */ 
static char profile_level_defmed[5][4] = 

{ 

/* HL H-14ML LL */ 

{1, 1, 1, 0), I* HP */ 

{0, 1, 0, 0}, /* Spat V 

{0, 0, 1, 1}, /* SNR */ 

(1, 1, 1, 1}, /* MP */ 

{0, 0, 1, 0} /*SP */ 

}; 

static struct level_limits { 
int horfcode; 
int vertfcode; 
int horsize; 
int vertsize; 
int samplerate; 
int bit_rate; /* Mbit/s */ 
int vbv bufifer size; /* 16384 bit steps */ 

} maxval_tab[4] = 

{ 

{9, 5, 1920, 1152, 62668800, 80, 597}, /♦ HL */ 

(9, 5, 1440, 1152, 47001600, 60, 448}, /* H-14 */ 


{8, 5, 720, 576, 10368000, 15, 112}, /* ML */ 

{7,4, 352, 288, 3041280, 4, 29} /* LL */ 

}; 

#defme SP 5 
#define MP 4 
#define SNR 3 
#defme SPAT 2 
#define HP 1 

#define LL 10 
#defme ML 8 
#defineH14 6 
#defme HL 4 

void profile_and_level_checks() 

{ 

int i; 

struct leveljimits *maxval; 

if (profile<0 || profile>15) 
error("profile must be between 0 and 15”); 

if (level<0 || level>15) 
error("level must be between 0 and 15"); 

if (profile>=8) 

{ 

if (Iquiet) 

fprintf(stderr,"Waming: profile uses a reserved value, conformance checks skipped\n”); 
return; 

} 

if (profile<HP || profile>SP) 
error("undefined Profile”); 

if (profile=SNR || profile=SPAT) 

error(”This encoder currently generates no scalable bitstreams"); 

if (level<HL || level>LL || level&l) 
error("undefined Level"); 

maxval = &maxval_tab[(level-4) » 1]; 

/* check profile@level combination */ 
if(!profile_level_defined[profile-l][(level-4) » 1]) 
error("undefined profile@level combination"); 


/* profile (syntax) constraints */ 

if (profile=SP && M!=l) 
error("Simple Profile does not allow B pictures"); 


if (profile!=HP && chroma_format!=CHROMA420) 
error("chroma format must be 4 2 0 in specified Profile"); 

if (profile=HP && chroma_format=CHROMA444) 
error("chroma format must be 4.2:0 or 4:2:2 in High Profile"), 

if (profile>=MP) /* SP, MP: constrained repeat_first_field */ 

{ 

if (frame_rate_code<=2 && repea tfirst) 
error("repeat_first_first must be zero"); 
if (frame_rate_code<=6 && prog_seq && repea tfirst) 
error("repeat_first_first must be zero"); 

} 

if (profileNHP && dc_prec=3) 
errorf' 1 1 bit DC precision only allowed in High Profile"); 


/♦ level (parameter value) constraints */ 

/* Table 8-8 */ 

if (frame_rate_code>5 && level>=ML) 
error("Picture rate greater than permitted in specified Level"); 

for (i=0; i<M; i++) 

{ 

if (motion_data[i] .forw hor f code > maxval->hor_f_code) 
error("forward horizontal f_code greater than permitted in specified Level"); 

if (motion_data[i].forw_vert_f_code > maxval ->vert_f_code) 
error("forward vertical f code greater than permitted in specified Level”); 

if (i!=0) 

{ 

if (motion_data[i].back_hor_f_code > maxval->hor_f_code) 
error("backward horizontal f code greater than permitted in specified Level"); 

if (motion_data[i].back_vert_f_code > maxval ->vert_f_code) 
error("backward vertical f code greater than permitted in specified Level"); 

} 

} 

/* Table 8-10 */ 

if (horizontal_size > maxval->hor_size) 
error("Horizontal size is greater than permitted in specified Level"); 

if (vertical_size > maxval->vert_size) 

enor("Horizontal size is greater than permitted in specified Level"), 

/♦Table 8-11 ♦/ 

if (horizontal_size*vertical_size*frame_rate > maxval->sample_rate) 
error("Sample rate is greater than permitted in specified Level"); 


/♦ Table 8-12 ♦/ 


if (bit_rate> 1 ,0e6 * maxval->bit_rate) 
error("Bit rate is greater than permitted in specified Level"); 


/* Table 8-13 */ 

if (vbvbuffersize > maxval->vbv_buffer_size) 
error("vbv_buffer_size exceeds High Level limit"); 

} 


/*********************************/ 


/* 


*/ 

/* 

putseql.qjp 

*/ 

/* 


*/ 


/*********************************/ 


#include <stdio.h> 

#include <string.h> 

#include "c:\nick\mpeg\org\configh" 

^include "c:\nick\mpeg\org\globaI.h" 

void putseq() 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fD, n, np, nb; 
int sxf, syf, sxb, syb; 

FILE *f_d, ♦f status; 

char name[ 128], namel[128]; 

unsigned char *neworg[3], *newref[3]; 

rc_init_seq(); /* initialize rate control */ 

/* sequence header, sequence extension and sequence display extension */ 

putseqhdr(); 

if (Impegl) 

{ 

putseqextO; 

putseqdispext(); 

} 

/* loop through all frames in encoding/decoding order */ 

for (i=0; i<nframes; i++) 

{ 

/* fD: lowest frame number in current GOP 
* 

* first GOP contains N-(M-1) frames, 

* all other GOPs contain N frames 

*/ 

ID = N*((i+(M-1))/N) - (M-l); 

if (fl)<0) 

1D=0; 

if (i==0 || (i-l)%M=0) 

{ 

/* I or P frame */ 
for 
{ 

/* shuffle reference frames */ 
neworg[j] = oldorgframe[j]; 
newrefU] = oldrefframe[j]; 
oldorgframe[j] = neworgframe[j]; 
oldrefframelj] = newrefframe[j]; 


neworgframe[j] = neworg[j]; 
newrefframe[j] = newreflj]; 

> 

/* f. frame number in display order */ 
f=(i=0)?0 : i+M-1; 
if (f>=nframes) 
f = nframes - 1; 

if (i=fO) /* first displayed frame in GOP is I */ 

{ 

/* I frame */ 
pictjype = I_TYPE; 

forw_hor_f_code = forw_vert_fcode =15; 
back_hor_f_code = back_vert_fcode = 15; 

/* n: number of frames in current GOP 
* 

* first GOP contains (M-l) less (B) frames 
*/ 

n = (i=0) ? N-(M-1) : N; 


/* last GOP may contain less frames */ 
if (n > nframes-fD) 
n = nframes-fD; 


/* number of P frames */ 
if (i=0) 

np - (n + 2*(M-1))/M - 1; /* first GOP */ 
else 

np = (n + (M-l))/M - 1; 

/* number of B frames */ 
nb = n - np - 1; 

rc_init_GOP(np,nb); 

putgophdr(fD,i=0); /* set closed GOP in first GOP only */ 

} 

else 

{ 

/* P frame */ 
picttype = P_TYPE; 

forw_hor_f_code = motion_data[0].forw_hor_f_code; 
forw_vert_f_code = motion_data[0].forw_vert_f_code; 
back_hor_f_code = back_vert_f_code = 15; 
sxf = motiondatafO], sxf; 
syf = motion_data[0].syf; 

} 

} 

else 

{ 

/* B frame */ 
for(j=0;j<3;j++) 


{ 

neworg[j] = auxorgframe[j], 
newref[j] = auxframe(j]; 

} 

/* f frame number in display order */ 
f = i - 1; 

picttype = BTYPE; 

n = (i-2)%M + 1; I* first B: n=l, second B. n=2, ... *1 
forwhorfcode = motion_data[n].forvv_hor_f_code; 
forw_vert_f_code = motion_data[n].forw_vert_f_code; 
backhorfcode = motion_data[n].back_hor_f_code; 
backvertfcode = motion_data[n] back_vert_f_code; 
sxf = motion_data[n].sxf; 
syf = motion_data[n].syf; 
sxb = motion_data[n].sxb; 
syb = motion_data[n].syb; 

} 

tempref = f - fO; 

frame_pred_dct = frame_pred_dct_tab[pict_type-l]; 
q_scale_type = qscale_tab[pict_type-l], 
intravlc = intravlc_tab[pict_type-l]; 
altscan = altscan_tab[pict_type-l]; 

t* read in next picture to be coded */ 

sprintf(name,tplorg,f+fTameO); 

readfTame(name,neworg); 

/* for frame pictures only */ 

pictstruct = FRAMEPI CTURE ; 


/* record status data for parallel program 
that implements module ME 

*/ 


sprintf(name,tplorg,i); 

sprintf(namel,"c:\\nick\\mpeg\\dmot\\%s.sta M ,name); 
if (!(f_status = fopen(namel,"w/0"))) 

{ 

sprintfferrortext, "Couldn’t open %s\n",namel); 
error(errortext); 

} 

fprintf(f_status, "%d %d %d %d %d %d %d %d \n", 
pictjype, pict_struct, sxf, syf, sxb, syb, f, frame_pred_dct); 

fclose(f_status), 


/* record referenc frames for parallel program 


that implements module ME 


*/ 


sprintf(name,tplorg,f+frameO); 

sprintf(name 1 , "c:\\nick\\mpeg\\dmot\\%s.oof', name), 
if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(erTOrtext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fwrite(oldorgframe[0], 1 ,vvidth*height,f_d); 
fclose(fd); 

sprintf(namel,"c:\\nick\\mpeg\\dmot\\%s.nof’,name); 
if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fwrite(neworgframe[0], 1 ,width*height,f_d); 
fclose(fd); 

sprintf(namel,"c:\\nick\\mpeg\\dmot\\%s.o£r,name); 
if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(em>rtext, n Couldn , t open %s\n",namel); 
error(errortext); 

} 

fwrite(oldrefframe[0], 1 ,width*height,f_d); 
fclose(fd); 

sprintf(namel ) "c:\\nick\\mpeg\\dmot\\%s.nff' ) name); 
if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fwrite(newrefframe[0], 1 ,width*height,f_d); 
fclose(fd); 

sprintf(namel,"c:\\nick\\mpeg\\dmot\\%s.nor",name); 
if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(en'ortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fwrite(neworg[0], l,width*height,f_d); 
fclose(f_d); 


/* motion estimation ( ME module ) */ 

motion_estimation(oldorgframe[0], neworgframe[0], 
oldreflframe[0], newre£frame[0], 


neworgfO], 
sxf, syf, 
sxb, syb, 
mbinfo); 

/* prediction *1 

predict(oldreflTame,ne\vrefiTame,predfTame,0, mbinfo); 

/* transform ( TR module ) */ 
transform(predframe,neworg, mbinfo, blocks, 1 ); 

I* quantization & variable length coding */ 
putpict(neworg[0]); 

/* inverse quantization */ 
for (k=0; k<mb_height*mb_width; k++) 

{ 

if (mbinfofk] . mbtype & MBINTRA) 
for (j=0; j<block_count; j++) 

iquant_intra(blocksfk*block_count+j],blocks[k*block_count+j], 
dc_prec,intra_q,mbinfo[k] . mquant); 
else 

for (j=0 j<block_countj++) 

iquant_non_intra(blocks[k*block_count+j] ,blocks[k*block_count+j] , 
inter_q,mbinfo[k], mquant); 

} 

/* inverse transform */ 

transform(predframe,newref, mbinfo, blocks, 0); 

} 

putseqendO; 

} 


/ft********************************/ 
/* */ 

/* putseq2.q)p */ 

/* */ 

/*********************************/ 


^include <stdio.h> 

^include <string.h> 

#include "c:\nick\mpeg\org\config.h" 

#include "c\nick\mpeg\org\global.h" 

void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fO, n, np, nb; 
int a, p, q; 

FILE *f_mbi, 

char name[128], namel[128]; 
unsigned char *neworg[3], *newref[3]; 

rc_init_seq(); /* initialize rate control */ 

/* sequence header, sequence extension and sequence display extension */ 

putseqhdr(); 

if (Impegl) 

{ 

putseqext(); 

putseqdispext(); 

} 

/* loop through all frames in encoding/decoding order */ 
for (i=0; i<nframes; i++) 

{ 

/* fO: lowest frame number in current GOP 

* 

* first GOP contains N-(M-1) frames, 

* all other GOPs contain N frames 
*/ 

fO = N*((i+(M-1))/N) - (M-l); 


if (f0<0) f0=0; 

if (i==0 || (i-l)%M=0) 

{ 

/* I or P frame */ 
for 0=O;j<3;j++) 

{ 

I* shuffle reference frames */ 
neworg[j] = oldorgframe[j]; 
newref[j] = oldrefframe[j]; 
oldorgframe[j] = neworgframe[j]; 
oldrefifame[j] = newrefiFrame[j], 
neworgframe[j] = neworg[j]; 
newrefframe[j] = newreflj]; 


} 


/* f: frame number in display order */ 
f = (i=0) ? 0 : i+M-1; 
if (f>=nframes) f = nframes - 1; 

if (i=fO) /* first displayed frame in GOP is I */ 

{ 

/* I frame */ 
picttype = ITYPE; 

forwhorfcode = forw_vert_f_code =15; 
backhorfcode = back_vert_f_code =15; 

/* n: number of frames in current GOP 

* 

* first GOP contains (M-l) less (B) frames 
*/ 

n = (i=0) ? N-(M-1) : N; 

/* last GOP may contain less frames */ 
if (n > nframes-fO) 
n = nframes -ft); 

/* number of P frames */ 
if (i=0) 

np = (n + 2*(M-1))/M - 1; /* first GOP */ 
else 

np = (n + (M-l))/M- 1; 

/* number of B frames */ 
nb = n - np - 1; 

rc_init_GOP(np,nb); 

putgophdr(ft),i=0); /* set closed_GOP in first GOP only */ 

} 

else 

{ 

/* P frame */ 
picttype = PTYPE; 

forwhorfcode = motion_data[0].forw_hor_f_code; 
forwvertfcode = motion_data[0].forw_vert_f_code; 
back hor f code = back vert f code =15; 

} 

} 

else 

{ 

/* B frame */ 
for(j=0;j<3;j-H-) 

{ 

neworg[j] = auxorgframe[j]; 
newref[j] = auxframe[j]; 

} 


/* f: frame number in display order */ 
f = i - 1; 

picttype = BTYPE; 

n = (i-2)%M -Hi;/* first B: n=l, second B: n=2, ... *1 
forw_hor_f_code = motion_data[n].fonv_hor_f_code; 
forwvertfcode = motion_data[n].forw_vert_f_code; 
back_hor_f_code = motion_data[n].back_hor_f_code; 
back_vert_f_code = motion_data[n].back_vert_f_code; 

} 

tempref = f - fO; 

frame_pred_dct = frame_pred_dct_tab[pict_type-l], 
q_scale_type = qscale_tab[pict_type-l], 
intravlc = intravlc_tab[pict_type-l]; 
altscan = al tscan_tab[pict_type- 1 ] ; 

/* read in next frame for coding */ 
sprintf(name,tplorg,f+frameO); 
readframe(name,neworg); 

/* for frame pictures only */ 

pictstruct = FRAMEPICTURE, 

/* read in data produced by the parallel program 
that implements module ME 

*1 

sprintf(name 1 ,"c:\\nick\\mpeg\\mot_out\\%s mbi",name); 
if (!(f_mbi = fopen(namel,"r"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

a=0; 

for (p=0; p<mb_height2; p++) 

{ 

for (q=0; q<mb_width; q++) 

{ 

fscanf(f_mbi,"%d %d %d %d %d %d %d %d %d %d %d %d %d %d \n”, 
&mbinfo[a] . mbtype, 

&mbinfo[a] .motiontype, 

&mbinfo[a].MV[0][0][0], 

&mbinfo[a] .MV[0] [0] [ 1 ], 

&mbinfo[a].MV[0][l][0], 

&mbinfo[a] .MV[0] [ 1 ] [ 1 ] , 

&mbinfo[a] MV[1][0][0], 

&mbinfo[a].MV[l][0][l], 

&mbinfo[a].MV[l][l][0], 

&mbinfo[a].MV[l][l][l], 

&mbi nfo [a] . mv_field_sel [0] [0] , 

&mbinfo[a] . mvfieldsel [0] [ 1 ] , 

&mbinfo[a] . mv_field_sel[ 1 ] [0] , 

&mbinfo[a] . mv_field_sel [ 1 ] [ 1 ]); 


a++, 

} 

} 

fclose(fmbi); 

/* prediction */ 

predict(oldrefiFrame,newrefiFrame,predframe,0,mbinfo); 

/* transform (TR module) */ 

transfomi(predframe.neworg.mbinfo, blocks, 1 ); 

/* quantization & variable length coding */ 
putpict(neworg[0]); 

/* inverse quantization */ 

for (k=0; k<mb_height*mb_width, k++) 

{ 

if (mbinfo[k].mb_type & MBINTRA) 
for 0=0; j<block_count; j++) 

iquant_intra(b!ocks[k*block_count+j],blocks[k*block_count+jl, 

dc_prec,intra_q,mbinfo[k).mquant), 

else 

for 0=0 J<block_count j++) 

iquant_non_intra(blocks[k*block_count+j],blocks[k*block_count+j], 

inter_q,mbinfo[k].mquant); 

} 

/* inverse transform */ 

transform(predfTame,newref,mbinfo,blocks,0); 


} 

putseqend(); 

} 


/*********************************/ 
/* */ 

/* putseq4.cpp */ 

/* */ 

/*********************************/ 


#include <stdio.h> 

#include <string.h> 

#include "c:\nick\mpeg\org\config.h" 

^include "c:\nick\mpeg\org\global.h" 

void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fD, n, np, nb; 
int a, p, q, b, c; 
int sxf, syf, sxb, syb; 

FILE *f_mbi, *f_d, *f_status, *f_blk; 
char name[128], namel[128]; 
unsigned char *neworg[3], *newrefI3]; 

rc init seqO; /* initialize rate control V 

/* sequence header, sequence extension and sequence display extension */ 

putseqhdrO; 

if (Impegl) 

{ 

putseqext(); 

putseqdispext(); 

} 


/* loop through all frames in encoding/decoding order */ 
for (i=0; i<nframes; i++) 

{ 

/* ID: lowest frame number in current GOP 
* 

* first GOP contains N-(M-1) frames, 

* all other GOPs contain N frames 

*1 

fO = N*((i+(M-1))/N) - (M-l); 


if (f0<0) f0=0; 

if (i=0 || (i-l)%M=0) 

{ 

/* I or P frame */ 
for (j=0;j<3;j-H-) 

{ 

/* shuffle reference frames */ 
neworg[j] = oldorgframe|j]; 
newreflj] = oldrefffamelj]; 
oldorgframe[j] = neworgframe[j]; 
oldrefframe[j] = newrefframe(j]; 


neworgframe[j] = neworg[j]; 
newre£Frame[j] = newref[j]; 

} 

/* f: frame number in display order */ 
f = (i=0) ? 0 : i+M-1; 
if (f>=nframes) f = nframes - 1 ; 

if (i=fD) /* first displayed frame in GOP is I */ 

{ 

/* I frame */ 
picttype = I_TYPE; 

forw_hor_f_code = forwvertfcode =15; 
backhorfcode = back_vert_f_code = 15; 

/* n: number of frames in current GOP 
* 

* first GOP contains (M-l) less (B) frames 

V 

n = (i=0)?N-(M-l) : N, 

/* last GOP may contain less frames */ 
if (n > nframes-fD) 
n = nframes-fD; 

/* number of P frames */ 
if (i=0) 

np = (n + 2*(M-1))/M - 1; /* first GOP */ 
else 

np = (n + (M-l))/M - 1, 

/* number of B frames */ 
nb = n - np - 1; 

rc_imt_GOP(np,nb); 

putgophdr(f0,i==0), /* set closed GOP in first GOP only */ 

} 

else 

{ 

/* P frame */ 
picttype = PTYPE; 

forwhorfcode = motion_data[0].forw_hor_f_code; 
forw_vert_f_code = motion_data[0].forw_vert_f_code; 
backhorfcode = back_vert_f_code =15; 
sxf = motion_data[0].sxf; 
syf = motion_data[0].syf; 

} 

} 

else 

{ 

/* B frame */ 
for (j=0;j<3;j-H-) 

{ 


neworg[j] = auxorgframe[j]; 
newreflj] = auxframe[j]; 

} 

/* f: frame number in display order */ 
f=i - 1; 

pict_type = B_TYPE; 

n = (i-2)%M + 1; /* first B: n=l, second B: n=2, ... */ 
forw_hor_f_code = motion_data[n].fonv_hor_f_code; 
forw_vert_f_code = motion_data[n].forw_vert_f_code; 
back_hor_f_code = motion_data[n].back_hor_f_code; 
back_vert_f_code = motion_data[n].back_vert_f_code; 
sxf = motion_data[n].sxf; 
syf = motion_data[n].syf; 
sxb = motion_data[n].sxb; 
syb = motion_data[n].syb; 

} 

temp_ref = f - fD, 

frame_pred_dct = frame_pred_dct_tab[pict_type-l]; 
q_scale_type = qscale_tab[pict_type-l]; 
intravlc = intravlc_tab[pict_type-l]; 
altscan = altscan_tab[pict_type-l]; 

/* read in next frame for coding */ 
sprintf(name,tplorg,f+frameO); 
readframe(name,neworg); 

/* for frame pictures only */ 

pictstruct = FRAMEPICTURE; 

/* motion estimation (module ME) */ 

motion_estimation(oldorgframe[0], neworgframe[0], 
oldrefframe[0], newrefframefO], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo), 

/* predictionn */ 

predict(oldrefframe,newreflTame,predframe,0, mbinfo); 

/* write status data for the parallel program 
that implements module TR 

*/ 

sprintf(name,tplorg,i); 

sprintf(namel,"c:\\nick\\mpeg\\dtm\\%s.sta",name); 
if (!(f_status = fopen(namel,"w/0"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fprintf(f_status,"%d %d %d \n", f, pict_struct, frame_pred_dct); 



fclose(f_status); 


/* write image data for the parallel program 
that implements module TR 

*/ 

sprintf(name,tplorg,f+frameO); 

sprintf(namel,"c:\\nick\\mpeg\\dtrn\\%sy.nor",name), 
if (!(f d = fopen(namel,"wb”))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fwrite(neworg[0], 1 ,width*height,f_d); 
fclose(f_d); 

sprintf(name 1 ,"c:\\nick\\mpeg\\dtm\\%su.nor",name); 
if (!(f d = fopen(namel,"wb"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fwrite(neworg[ 1], l,chrom_width*chrom_height,f_d); 
fclose(f_d); 

sprintf(namel, n c:\\nick\\mpeg\\dtm\\%sv.nor",name); 
if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(eiTortext,"Couldn’t open %s\n",namel); 
erTor(errortext); 

} 

fwrite(neworg[2],l,chrom_width*chrom_height,f_d); 

fclose(f_d); 

sprintf(namel,"c:\\nick\\mpeg\\dtm\\%sy.prf',name); 
if(!(f d = fopen(namel,"wb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext), 

} 

fwrite(predframe[0], 1 ,width*height,f_d), 
fclose(f_d); 

sprintf(namel,"c:\\nick\\mpeg\\dtm\\%su.prr,name); 
if (!(f d = fopen(namel,"wb"))) 

{ 

sprintf(errortext > "Couldn't open %s\n",namel); 
error(errortext), 

} 

fwrite(predframe[ 1 ], 1 ,chrom_width*chrom_height,f_d), 
fclose(f_d); 

sprintf(namel,"c\\nick\\mpeg\\dtm\\%sv.prf',name); 


if (!(f_d = fopen(namel,"wb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

f\vrite(predframe[2], 1 ,chrom_width*chrom_height,f_d); 
fclose(f_d); 

/* transform (module TR) */ 

transform(predframe,nevvorg,mbinfo, blocks, 1 ); 

/* quantization & variable length coding */ 
putpict(neworg[0]), 

/* inverse quantization */ 

for (k=0; k<mb height*mb_width; k++) 

{ 

if (mbinfo[k].mb_type & MBINTRA) 
for (j=0, j<block_count; j++) 

iquant_intra(blocks[k*block_count+j] ,blocks[k*block_count+j] , 
dc_prec,intra_q,mbinfo[k].mquant); 
else 

for (j=OJ<block_countJ-H-) 

iquant_non_intra(blocks[k*block_count+j], blocks [k*block_count+j], 
inter_q,mbinfo[k] . mquant); 

} 

/* inverse transform */ 
transform(predframe,newref,mbinfo,blocks,0); 


putseqend(); 

} 


/*********************************/ 


/* 


V 

/* 

putseq5.cpp 

V 

/* 


*/ 


/*********************************/ 


#include <stdio.h> 

#include <string.h> 

^include "c:\nick\mpeg\org\config.h" 

#include "c:\nick\mpeg\org\global h" 

void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 

int i, j, k, f, fD, n, np, nb; 

int a, p, q, b, c; 

int sxf, syf, sxb, syb; 

FILE *f_mbi, *f_blk; 

char name[128], namel[128]; 

unsigned char *neworg[3], *newrefI3]; 

rc_init_seq(); /* initialize rate control */ 

/* sequence header, sequence extension and sequence display extension */ 

putseqhdr(); 

if (Impegl) 

{ 

putseqext(), 

putseqdispextO; 

} 

/* loop through all frames in encoding/decoding order */ 
for (i=0; i<nframes; i++) 

{ 

I* fO. lowest frame number in current GOP 

* 

* first GOP contains N-(M-l) frames, 

* all other GOPs contain N frames 

*/ 

10 = N*((i+(M-1))/N) - (M-l); 


if (f0<0) f0=0; 

if (i=0 || (i-l)%M==0) 

{ 

/* I or P frame */ 
for (j=0;j<3;j-H-) 

{ 

/* shuffle reference frames */ 
neworg[j] = oldorgframe[j], 
newreflj] = oldrefframe[j]; 
oldorgframe[j] = neworgframelj], 
oldrefframe[j] = newrefframe[j]; 
neworgframe[j] = neworgfj]; 


newrefframe[j] = newref[j]; 

} 

/* f: frame number in display order */ 
f = (i==0) ? 0 : i+M-1; 
if (f>=nframes) f = nframes - 1; 

if (i==fO) /* first displayed frame in GOP is 1 */ 

{ 

/* I frame */ 
pict_type = ITYPE; 

forw_hor_f_code = forwvertfcode =15; 
back_hor_f_code = back_vert_f_code =15; 

/* n: number of frames in current GOP 
* 

* first GOP contains (M-l) less (B) frames 
*/ 

n = (i=0) ? N-(M-1) : N; 


/* last GOP may contain less frames */ 
if (n > nframes-fD) n = nframes-fD; 

/* number of P frames *1 
if (i=0) 

np = (n + 2*(M-1))/M - 1; /* first GOP */ 
else 

np = (n + (M-l))/M - 1; 

/* number of B frames */ 
nb = n - np - 1; 

rc_init_GOP(np,nb); 

putgophdr(f0,i=0); /* set closed_GOP in first GOP only */ 

} 

else 

{ 

/* P frame */ 
picttype = PTYPE; 

forwhorfcode = motion_data[0].fonv_hor_f_code; 
forw_vert_f_code = motion_data[0].forw_vert_f_code; 
back_hor_f_code = back_vert_f_code = 15; 
sxf = motion_data[0].sxf; 
syf = motion_data[0].syf; 

} 

} 

else 

{ 

/* B frame */ 
for 0=O;j<3;j++) 

{ 

neworgjj] = auxorgframelj], 
newreflj] = auxframe[j]; 



} 


/* f: frame number in display order */ 
f = i * l; 

pict_type = B_TYPE; 

n = (i-2)%M + 1; /* first B' n=l, second B: n=2, ... */ 
fonv_hor_f_code = motion_data[n].forw_hor_f_code; 
forw__vert_f_code = motion_data[n].forw_vert_f_code; 
backJior_f_code = motion jiata[n], back Jior_f_code; 
back_vert_f_code = motion_data[n].back_vert_f_code; 
sxf = motion_data[n].sxf; 
syf = motion_data[n].syf; 
sxb = motion_data[n].sxb; 
syb = motion_data[n].syb; 

} 

tempref = f - fO; 

frame__pred_dct = frame_pred_dct_tab[pict_type-l], 
q_scale_type = qscale_tab[pict_type-l]; 
intravlc = intravlc_tab[pict_type-l]; 
altscan = altscan_tab[pict_type-l]; 

/* read in next frame for coding */ 
sprintf(name,tplorg,f+ffameO), 
readframe(name,neworg); 

/* for frame pictures only */ 
pict_struct = FRAME_PICTURE; 

/* motion estimation (module ME) */ 

motion_estimation(oldorgframe[0], neworgfirame[0], 
oldrefffame[0], newrefframe[0], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo); 


/* prediction *1 

predict(oIdrefiFrame,newrefframe,predframe,0, mbinfo); 

/* read in status data produced by the parallel program 
that implements module TR 

*/ 

sprintf(namel,"cA\nick\\mpeg\\tm_out\\%s.mbi",name); 
if (!(f mbi = fopen(namel,"r"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 


sprintf(namel,"c:\\mck\\mpeg\\tm_out\\%s.blk",name); 
if f = fopen(namel,"r"))) 


{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(eiToitext); 

} 

a=0; 

for (p=0; p<mb_height2; p++) 
for (q=0, q<mb_width; q++) 

{ 

fscanf(f_mbi,"%d ", &mbinfo[a].dct_type); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

fscanf(f_blk,"%d ", &blocks[a*block_count+b] [c] ); 


a++; 

} 

fclose(fmbi); 

fclose(fblk); 

/* quantization & variable length coding */ 
putpict(neworg[0]); 

/* inverse quantization *1 

for (k=0; k<mb_height*mb_width; k++) 

{ 

if (mbinfo[k].mb_type & MBINTRA) 
for 0=0; j<block_count; j++) 

iquant_intra(blocks[k*block_count+j],blocks[k*block_count+j], 

dc_prec,intra_q,mbinfo[k].mquant); 

else 

for 0=OJ<block_count j++) 

iquant_non_intra(blocks[k*block_count+j], blocks [k*block_count+j], 
inter_q,mbinfo[k].mquant); 

} 

/* inverse transform */ 

transform(predffame,newref,mbinfo, blocks, 0); 


} 

putseqend(); 

} 


/*********************«***********/ 
/* */ 

/* motion. cpp */ 

/* */ 

/♦♦♦it*****************************/ 


#include <stdio.h> 

^include "c:\nick\mpeg\org\config.h" 

#include ’’c:\nick\mpeg\org\gIobaI.h" 

/* private prototypes */ 

static void frame_estimate _ANSI_ARGS_((unsigned char *org, 
unsigned char *ref, 
unsigned char *mb, 
int i, 
intj, 
int sx, 
int sy, 
int *iminp, 
int *jminp, 
int *imintp, 
int *jmintp, 
int *iminbp, 
int *jminbp, 
float ♦dframep, 
float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], 
int jmins[2][2])); 

static float fullsearch _ANSI_ARGS_((unsigned char *org, 
unsigned char *ref, 
unsigned char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((unsigned char *blkl, 
unsigned char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 

float distlim)); 


static float dist2 _ANSI_ARGS_((unsigned char *blkl, 


unsigned char *blk2, 
int lx, 
char hx, 
char hy, 
char h)); 

static float bdistl _ANSI_ARGS_((unsigned char *pf, 
unsigned char *pb, 
unsigned char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float bdist2 _ANSI_ARGS_((unsigned char *pf, 
unsigned char *pb, 
unsigned char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((unsigned char *p, int lx)); 


/* 

* motion estimation for progressive and interlaced frame pictures 

* 

* oldorg: source frame for forward prediction (used for P and B frames) 

* neworg: source frame for backward prediction (B frames only) 

* oldref: reconstructed frame for forward prediction (P and B frames) 

* newref: reconstructed frame for backward prediction (B frames only) 

* cur: current frame (the one for which the prediction is formed) 

* sxf.syf: forward search window (frame coordinates) 

* sxb,syb: backward search window (frame coordinates) 

* mbi: pointer to macroblock info structure 

* 

* results: 

* mbi-> 

* mb type: 0, MB INTRA, MBFORWARD, MB BACKWARD, MB_FORWARD|MB_BACKWARD 

* MVUfln- motion vectors (frame format) 

* mv_field_sel: top/bottom field (for field prediction) 

* motion_type: MC_FRAME, MC_FIELD 

* 

* uses global vars: pict_type, frame_pred_dct 

*/ 

void motion_estimation(unsigned char *oldorg, unsigned char *neworg, 
unsigned char *oldref, unsigned char * newref, 
unsigned phar *cur, 
int sxf, int syf, 


int sxb, int syb, 
struct mbinfo *mbi) 

{ 

int i, j; 

int iminjmin,iminfjminf,iminrjnunr; 
int imintjmint,iminbjminb; 
int imintfjmintfiiminbfjminbf; 
int imintrjmintr.iminbrjminbr, 
float var,vO; 

float dmc,dmcf,dincr,(lmci,vmc,vnicf,vmcr,vmci; 
float dmcfield,dnicficldf,dmcfieldr,dmcfieldi , 
int tsel,bsel,tself,bself,tselr,bse!r; 
unsigned char *mb; 
int imins[2][2]jmins[2][2]; 


/* loop through all macroblocks of the picture */ 

for (j=0, jcheight 2; j+=16) 

{ 

for (i=0; i<width; i+=16) 

{ 

/* if (pict_struct=FRAME_PICTURE) */ 

mb = cur + i + width*j; 

var = variance(mb, width); 

/* for I_type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MBINTRA; 


/* for P type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&imin,&jmin,&imint,&jmint > &iminb,&jminb, 

&dmc,&dmcfield,&tsel,&bsel,iminsjmins); 

/* select between frame and field prediction */ 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width,imin& 1 jmin& 1,16); 

} 

else 

{ 

mbi->motion_type = MC_FIELD; 
dmc = dmcfield; 


vntc = disl2(oldref+(tsel?width:0)+(imint»l )+(width« 1 )*(jmint» 1), 
mb,width« 1 ,imint& 1 jmint& 1 ,8); 

vmc+= dist2(oldref+(bsel?width:0)+(iminb» 1 )+(width« l)*(jminb» 1 ), 
mb+ width, width« 1 ,iminb& 1 jminb& 1 ,8); 


/* select between intra or non-intra coding: 

* 

* selection is based on intra block variance (var) vs 

* prediction error variance (vine) 

* 

* blocks with small prediction error are always coded non-intra 

* even if variance is smaller (is this reasonable?) 

V 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MB_INTRA; 
else 
{ 

/* select between MC / No-MC 

* 

* use No-MC if var(No-MC) <= 1.25*var(MC) 

* (i.e slightly biased towards No-MC) 

* 

* blocks with small prediction error are always coded as No-MC 

* (requires no motion vectors, allows skipping) 

*/ 

vO = dist2(o!dref+i+width*j,mb,width,0,0,16); 
if (4*v0>5*vmc && v0>=9*256) 

{ 

/* use MC */ 
var = vmc; 

mbi->mb_type = MBFORWARD; 
if (mbi->motion_type=MC_FRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0][l] = jmin - (j«l); 

} 

else /* if (mbi->motion_type— MC_FEELD) */ 

{ 

/* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - (j«l); 
mbi->MV[l)[0][0] = iminb - (i«l); 
mbi->MV[l][0][l] = (jminb«l) - (jeel); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel, 

} 

> 

else 

{ 

/* No-MC */ 
var = vO; 


mbi->mb_type = 0, 
mbi->motion_type = MC_FRAME, 
mbi->MV[0][0][0] = 0; 
mbi->MV[0][0][l] = 0; 

} 

} 

} 


/* for Btype pictures */ 

if (pict_type==B_TYPE) 

{ 

/* forward prediction *1 
frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf,&dmcfieldf,&tself,&bself,iminsjmins); 

/* backward prediction */ 
frame_estimate(neworg,newref,mb,ij,sxb,syb, 

&iminr,&jminr,&imintr,&jmintr,&iminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

/* calculate interpolated distance */ 

/* frame */ 

dmci = bdistl(oldref+(iminf»l)+width*(jminf»l), 
newref+(iminr» 1 )+width*(jminr» 1 ), 
mb, width, iminf& 1 jminf& 1 ,iminr& 1 jminr& 1,16); 

/* top field */ 

dmcfieldi = bdistl(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jmintf»l), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1 ), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1 ,8); 

/* bottom field */ 

dmcfieldi+= bdist 1 (oldref+(iminbf» 1 )+(bself?width:0)+(width« l)*(jminbf» 1 )» 
newref+(iminbr» l)+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width« l,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1,8), 

/* select prediction type of minimum distance from the 
* six candidates (field/frame * forward/backward/interpolated) 

*/ 

if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

/* frame, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME, 
vmc = bdist2(oldref+(iminf»l)+width*(jminf»l), 
newref+(iminr» 1 )+width*(jminr» 1 ), 
mb, width, iminf& 1 jminf& 1 ,iminr& 1 jminr& 1 , 16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 


&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

I* field, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FIELD; 

vmc = bdist2(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jmintf»l), 
newref+(imintr»l)+(tselr?width:0)+(width«l)*(jniintr»l), 
mb,wtdth« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1 ,8); 
vmc+= bdist2(oldref+(iminbf» 1 )+(bself?width:0)+(width« l)*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MB_FORWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(iminf»l)+width*(jminf> > l),mb, 
width, imi nf& 1 jminf& 1 , 16); 

} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

/* field, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FIELD; 

vmc = dist2(oldref+(tself7width:0)+(imintfi»l)+(width«l)*(jmintf»l), 
mb,width« 1 ,imintf& 1 jmintf& 1,8); 

vmc+= dist2(oldref+(bself?width:0)+(iminbf» l)+(width« l)*(jminbf» 1), 
mb+width, width« 1 ,iminbf& 1 jminbf& 1,8), 

} 

else if (dmcr<dmcfieldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi->motion_type = MCFRAME; 
vmc = dist2(newref+(iminr»l)+width*(jminr»l),mb, 
width, imi nr& 1 jminr& 1 , 16); 

} 

else 

{ 

I* field, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi- > motion_tyj>e = MCFIELD; 

vmc = dist2(newTef+(tselr?width:0)+(imintr»l)+(width«l)*(jmintr»l), 
mb,width« 1 ,imintr& 1 jmintr& 1,8); 

vmc+= dist2(newref+(bselr?width:0)+(iminbr» 1 )+(width« l)*(jminbr» 1 ), 
mb+width, width« 1 ,iminbr& 1 jminbr& 1 ,8); 


/* select between intra or non-intra coding: 

* 

* selection is based on intra block variance (var) vs. 

* prediction error variance (vmc) 


* blocks with small prediction error are always coded non-intra 

* even if variance is smaller (is this reasonable?) 

*/ 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MB_INTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_type— MC_FRAME) 

{ 

/* forward */ 

mbi->MV[0][0][0] = iminf - (i«l); 
mbi'>MV[0][0][l] = jminf - (j <<: 1); 

/* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi'>MV[0][l][l] = jminr - <j«l); 

} 

else 

{ 

/* these are FRAME vectors */ 

/♦ forward */ 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0] [0] [ 1 ] = (jmintf«l) - 0«1); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0][l) = (jminbf«l) - (j«l); 
mbi->mv_field_sel[0][0] = tself; 
mbi'>mv_field_sel[l][0] = bself; 

/* backward */ 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - (j«l); 
mbi->MV[l][l][0] = iminbr - (i«l); 
mbi->MV[l][l][l] = (jminbr«l) - (j <<: l); 
mbi->mv_field_sel[0][l] = tselr; 
mbi*>mv_field_sel[l][l] = bselr; 

} 

} 

} 

mbi->var = var; 


mbi++; 

} 

if (Iquiet) 

{ 

putc('.’,stderr), 

fflush(stderr); 

} 


} 


if (Iquiet) putcC\n',stderr); 

} 


/* 

* frame picture motion estimation 

* 

* org- top left pel of source reference frame 

* ref: top left pel of reconstructed reference frame 

* mb: macroblock to be matched 

* ij: location of mb relative to ref (=center of search window) 

* sx,sy. half widths of search window 

* iminpjminp,dframep: location and value of best frame prediction 

* imintpjmintp.tselp: location of best field pred. for top field of mb 

* iminbpjminbp,bselp: location of best field pred. for bottom field of mb 

* dfieldp: value of field prediction 
*/ 

static void fTame_estimate(unsigned char *org, unsigned char *ref, 
unsigned char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db,dmint,dminb; 

int imint,iminbjmintjminb; 

/* frame prediction */ 

♦dframep = fullsearch(org,ref,mb,width,ij,sx,sy, 16, width, height, 
iminpjminp); 

/* predict top field from top field */ 

dt = fullsearch(org,ref,mb,width«l,ij»l,sx,sy»l,8,width,height»l, 
&imint,&jmint); 

/* predict top field from bottom field */ 

db = fullsearch(org+width,ref+width,mb,width« 1 ,i j»l ,sx,sy» 1 ,8,width,height» 1 , 
&iminb,&jminb); 

imins[0][0] = imint; 
jmins[0][0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 

} 

else 



{ 

dmint=db; *imintp=iminb; *jmintp=jminb; *tselp=l; 

} 

/* predict bottom field from top field */ 

dt = fullsearch(org > ref,mb+width,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&imint,&jmint); 

/* predict bottom field from bottom field */ 
db = fiillsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 

imins[0][l] = imint; 
jmins[0][l] = jmint; 
imins[l][l] = iminb; 
jmins[l][l] =jminb; 

/* select prediction for bottom field */ 
if (db<=dt) 

{ 

dminb=db, *iminbp=iminb; *jminbp=jminb; *bselp=l, 

} 

else 

{ 

dminb=dt; *iminbp=imint; *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


/* 

* full search block matching 

* 

* bik: top left pel of (16*h) block 

* h: height of block 

* lx: distance (in bytes) of vertically adjacent pels in ref, bik 

* org: top left pel of source reference picture 

* ref: top left pel of reconstructed reference picture 

* iO jO: center of search window 

* sx,sy: half widths of search window 

* xmax,ymax: right/bottom limits of search area 

* iminp jminp: pointers to where the result is stored 

* result is given as half pel offset from ref(0,0) 

* , i.e. NOT relative to (iO jO) 

*/ 

static float fullsearch(unsigned char *org, unsigned char *ref, 
unsigned char *blk, 
int lx, int iO, int jO, 
int sx, int sy, 
char h, 

int xmax, int ymax. 


int *iminp. int *jminp) 

{ 

int ij,iminjmin,ilow,ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) 
ilow = 0; 

if (ihigh>xmax-16) 
ihigh = xmax-16; 


jlow=jO-sy; 
jhigh = jO + sy; 

if (jlow<0) 
jlow = 0; 

if (jhigh>ymax-h) 
jhigh = ymax-h; 

/* full pel search, spiraling outwards */ 

imin = iO; 
jmin =j0; 

dmin = distl(org+imin+lx*jmin,blk,lx,0,0,h,65536.0); 

sxy = (sx>sy) ? sx : sy, 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 
j = jO - 1; 

for (k=0, k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = distl(org+i+lx*j,blk,lx,0,0,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l) j++; 
else if (k<6*l) i— ; 
else j--; 



} 

} 

/* half pel */ 

dmin = 65536.0; 

imin «= 1; 

jmin «= 1; 

ilow = inun - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin>0); 

jhigh = jmin + (jmin<((ymax-h)«l)); 


for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i-H-) 

{ 

d = distl(ref+(i»l)+lx*(|»l),blk,lx,i&l j&l,h,dmin), 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

♦iminp = imin; 

♦jminp = jmin; 

return dmin; 

} 

/* 

* total absolute difference between two (16*h) blocks 

* including optional half pel interpolation ofblkl (hx,hy) 

* blkl,blk2: addresses of top left pels of both blocks 

* lx: distance (in bytes) of vertically adjacent pels 

* hx,hy: flags for horizontal and/or vertical interpolation 

* h: height of block (usually 8 or 16) 

* distlim: bail out if sum exceeds this value 
V 

static float distl (unsigned char *blkl, 
unsigned char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 
float distlim ) 

{ 

unsigned char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0, 


pi = blkl; 
p2 - blk2; 


if (!hx && !hy) 
for (j=0; j<h; j++) 

{ 

if ((v = pi [0] - p2[0])<0) v = -v; s+= (float) v; 
if((v = pl[l] -p2[l])<0) v = -v; s+= (float) v; 
if ((v = pi [2] - p2[2])<0) v = -v; s+= (float) v; 
if ((v = pi [3] -p2[3])<0) v = -v; s+= (float) v; 
if((v = pl[4] -p2[4])<0) v = -v; s+= (float) v; 
if ((v = pi [5] - p2[5])<0) v = -v; s+= (float) v; 

if ((v = pi [6] - p2[6])<0) v = -v; s+= (float) v; 

if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v; 

if ((v = pi [8] - p2[8])<0) v = -v; s+= (float) v; 

if ((v = pi [9] - p2[9])<0) v = -v; s+= (float) v; 

if ((v = pi [10] - p2[10])<0) v = -v; s+= (float) v; 
if ((v = pi [1 1] - p2[l 1])<0) v = -v; s+= (float) v; 
if ((v = pl[12] - p2[12])<0) v = -v; s+= (float) v; 
if ((v = pl[13] - p2[13])<0) v = -v; s+= (float) v; 
if ((v = p 1 [ 1 4] - p2[14])<0) v = -v; s+= (float) v; 
if ((v = pi [15] - p2[15])<0) v = -v; s+= (float) v; 

if (s >= distlim) 
break; 


pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (pl[i]+pl[i+l]+l)»l - p2[i]; 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pl +lx; 
for (j=0; j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (pl[i]+pla[i]+l)»l - p2[i]; 
if (v>=0) 
s+= (float) v; 
else 


s-= (float) v; 


} 

pi =pla; 
pla+= lx; 
p2+= lx, 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16, i++) 

{ 

v = (pl[i]+pl[i+l]+pla[i]+pla[i+l]+2)»2 - p2[i]; 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

return s; 

} 

/* 

* total squared difference between two (16*h) blocks 

* including optional half pel interpolation of blkl (hx,hy) 

* blkl,blk2: addresses of top left pels of both blocks 

* lx: distance (in bytes) of vertically adjacent pels 

* hx,hy. flags for horizontal and/or vertical interpolation 

* h: height of block (usually 8 or 16) 

*/ 

static float dist2(unsigned char "‘blkl, 
unsigned char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

unsigned char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0, 
pi = blkl; 
p2 = blk2; 
if(!hx&&!hy) 
for (j=0; j<h; j++) 


{ 

for (i=0; i<16; i++) 

{ 

v = plfij -p2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0, i<16; i++) 

{ 

v = (pl[i]+pl[i+l]+l)»l-p2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v * (pi [i]+pla{i]+l)»l - p2[ij; 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0, j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (pl[i]+pl[i+l]+pla[i]+pla[i+l]+2)»2 - p2[i]; 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 

} 



/* 

* absolute difference error between a (16*h) block and a bidirectional 

* prediction 

* 

* p2: address of top left pel of block 

* pf,hxf,hyf: address and half pel flags of forward ref. block 

* pb,hxb,hyb address and half pel flags of backward ref block 

* h- height of block 

* lx: distance (in bytes) of vertically adjacent pels in p2,pf,pb 
*/ 

static float bdistl (unsigned char *pf, unsigned char *pb, unsigned char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

unsigned char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
intv; 
float s; 

pfa = pf + hxf, 
pfb = pf + lx*hyf; 
pfc = pfb + hxf, 

pba = pb + hxb; 
pbb = pb + lx*hyb, 
pbc = pbb + hxb; 

s = 0; 

for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = ((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2)»2 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2)»2 + 1)»1 
- *p2++, 

if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

p2+= lx- 16; 
pf+= lx-16; 
pfa+= lx-16, 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 


return s; 

} 

/* 

* squared error between a (16*h) block and a bidirectional 

* prediction 

* 

* p2: address of top left pel of block 

* pf,hxf,hyf: address and half pel flags of forward ref. block 

* pb,hxb,hyb: address and half pel flags of backward ref. block 

* h: height of block 

* lx: distance (in bytes) of vertically adjacent pels in p2,pf,pb 
*/ 

static float bdist2(unsigned char *pf, unsigned char *pb, unsigned char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

unsigned char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
int v; 
float s; 


pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 


pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0; j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = ((*pf++ + *pfa++ + *pfb++ + * p fc++ + 2)»2 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2)»2 + 1)»1 
- *p2++; 

s+= (float) (v*v); 

} 

p2+= lx- 16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16, 
pbb+= lx-16, 
pbc+= lx-16; 

} 


return s; 


} 


/* 

* variance of a (16* 16) block, multiplied by 256 

* p: address of top left pel of block 

* lx: distance (in bytes) of vertically adjacent pels 
*/ 

static float variance(unsigned char *p, int lx ) 

{ 

char ij; 
unsigned int v; 
float s, s2; 

s = s2 = 0, 

for O=0;j<16;j++) 

{ 

for (i=0; i<16, i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

} 

p+= lx-16; 

} 

s=0.0625*s; 
return s2 - s*s, 

} 


/*********************************/ 


/* 


*/ 

/* 

transfrm.cpp 

*/ 

/* 


*/ 


/*********************************/ 

#include <stdio.h> 

#include <math.h> 

^include "c:\nick\mpeg\org\config.h M 
#include "c:\nickVmpeg\org\global.h" 


/* private prototypes*/ 

static void add_sub_pred _ANSI_ARGS_((unsigned char *pred, 
unsigned char *cur, 
int lx, short *blk, 
char add_sub)); 

/* private prototypes */ 

static void predict_mb _ANSI_ARGS_((unsigned char *oldref[], 
unsigned char *newrefl], 
unsigned char *cur[], 
int lx, int bx, int by, 
int pict_type, int pict_struct, 
int mb_type, int motion type, int secondfield, 
int PMV[2][2][2], 
int mv_field_sel[2][2], 
int dmvector[2])); 

static void pred _ANSI_ARGS_((unsigned char *src[], 
int sfield, 

unsigned char *dst[], 
int dfield, 

int lx, int w, int h, int x, int y, 
int dx, int dy, int addflag)); 

static void pred_comp _ANSI_ARGS_((unsigned char *src, unsigned char *dst, 
int lx, int w, int h, int x, int y, int dx, int dy, int addflag)); 

static void calc_DMV _ANSI_ARGS_((int DMVU[2], 
int *dmvector, 
int mvx, int mvy)); 

static void clearblock _ANSI_ARGS_((unsigned char *cur[], int iO, int jO)); 

/* 

* select between frame and field DCT 

♦ 

* preliminary version: based on inter-field correlation 

*/ 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

void transform(unsigned char *pred[], 
unsigned char *cur[], 


struct mbinfo *mbi. 
short b!ocks[][64], 
char forward_invers ) 

{ 

intij, il.jl.k, n, cc, offs, lx; 

/* 

* select between frame and field DCT 

* 

* preliminary version: based on inter-field correlation 

*/ 

if (forwardjnvers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

k = 0; 

for (j=0; j<height2; j+=16) 
for (i=0, i<width; i+=16) 

{ 

for (n=0; n<block_count; n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l, /* color component index */ 
if (cc=0) 

{ 

/* luminance */ 

if ((pict_struct=FRAME_PICTURE) && mbifk].dct_type) 

{ 

/* field DCT */ 

offs = i + ((n&l)«3) + width*(j+((n&2)» 1 )); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = width2, 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? i : i»l; 
jl = (chroma_format!=CHROMA420) ?j • j»l; 

if ((pict_struct=FRAME_PICTURE) && mbi(k].dct_type 
&& (chroma_format!=CHROMA420)) 


{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = il + (n&8) + chrom_width2*(jl+((n&2)«2)), 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += chrom_width; 

} 

if (forward_invers) /* 1 for forw ard DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+offs, 

lx, 

blocks [k*block_count+n], 

0 ); 

fdct(blocks[k*block_count+n]); 

} 

else I* 0 for invers DCT */ 

{ 

idct(blocks[k*block_count+n]); 

add_sub_pred(pred[cc]+offs, 

cur[cc]+offs, 

lx, 

blocks [k*block_count+n], 

i); 

} 

} 

k-H-; 

} 

} 


/* add prediction and prediction error, saturate to 0...255 */ 
/* subtract prediction from block data */ 
static void add_sub_pred(unsigned char *pred, 
unsigned char *cur, 
int lx, short *blk, 
char addsub ) 

{ 

int i, j; 

for(j=0;j<8;j++) 

{ 

for (i=0; i<8; i++) 


{ 

if (add sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ], 
else /* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 


I* 

* select between frame and field DCT 

* 

* preliminary version based on inter-field correlation 

*/ 

void dct_type_estimation(unsigned char *pred. 
unsigned char *cur, 
struct mbinfo *mbi ) 

{ 

short blk0[128], blkl [ 128]; 

int i, j, iO, jO, k, offs, sO, si, sqO, sql, sOl; 

double d, r; 

k = 0; 

for 00=0; j0<height2; j0+=16) 
for (i0=0; i0<width; i0+=16) 

{ 

if (ffame_pred_dct || pict_struct!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

/* interlaced frame picture *1 

/* 

* calculate prediction error (cur-pred) for top (blkO) 

* and bottom field (blkl) 

*/ 

for O^.j^ij-H-) 

{ 

offs = width*(0«l)+j0) + iO; 
for (i=0; i<16; i++) 

{ 

blk0[16*j+i] = cur[offs] - predjoffs]; 
blkl[16*j+i] = cur[ofifs+width] - pred[offs+width]; 
offs++; 

} 

} 

/* correlate fields */ 
s0=s 1 =sq0=sq 1 =s0 1 =0; 


for (i=0; i<128; i++) 

{ 

s0+= blkO[i]; 
sqO+= blkO[i]*blkO[i]; 
sl+= blklfi]; 
sql+= blk 1 [i] *blk 1 [i] ; 
s01+= blkO[i]*blkl[i]; 

} 

d = (sq0-(s0*s0)/128.0)*(sql-(sl*sl)/128.0); 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/128.0)/sqrt(d); 
if (r>0.5) 

mbi[k].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

k++; 

} 


/* fdctref.c, forward discrete cosine transform, double precision */ 

#ifndef PI 

# ifdef M_PI 

# define PI M_PI 

# else 

# define PI 3.14159265358979323846 

# endif 
#endif 


/* private data *1 

static double c[8][8]; /* transform coefficients */ 

void initfdctO 

{ 

int i, j; 
double s; 

for (i=0; i<8; i++) 

{ 

s = (i=0) ? sqrt(0. 125) : 0.5; 

for 0=0) j<8; j++) 
c[i][j] = s * cos((PI/8.0)*i*(j+0.5)>, 

} 

} 



void fdct(short *block) 

{ 

int i, j, k; 
double s; 
double tmp[64]; 

for (i=0; i<8; i++) 
for O=0;j<8;j-H-) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 


tmp[8*i+j] = s; 

} 

for (j=0;j<8;j++) 
for (i=0; i<8; i++) 

{ 

s = 0.0; 

for (k=0, k<8, k++) 
s += c[i][k] * tmp[8*k+j]; 

block[8*i+j] = (int)floor(s+0.499999); 

/* 

* reason for adding 0.499999 instead of 0.5. 

* s is quite often x.5 (at least for i and/or j = 0 or 4) 

* and setting the rounding threshold exactly to 0.5 leads to an 

* extremely high arithmetic implementation dependency of the result; 

* s being between x.5 and x.500001 (which is now incorrectly rounded 

* downwards instead of upwards) is assumed to occur less often 

* (if at all) 

*/ 

} 


/* idct.c, inverse fast discrete cosine transform 


/* inverse two dimensional DCT, Chen-Wang algorithm */ 

/* (cf. IEEE ASSP-32, pp 803-816, Aug. 1984) */ 

/* 32-bit integer arithmetic (8 bit coefficients) */ 
f* 1 1 mults, 29 adds per DCT */ 

I* sE, 18.8.91 */ 

/it*****************************************#***************/ 

/* coefficients extended to 1 2 bit for IEEE 1180-1 990 */ 

I* compliance sE, 2.1.94 */ 

ft*********************^^i,*******************t*********m*m*/ 


/* this code assumes » to be a two's-complement arithmetic */ 


f* right shift: (-2)»1 = -1 , (-3)»1 = -2 


*/ 


/* row (horizontal) IDCT 
* 

* 7 pi 1 

* dst[k] = sum c[l] * src[l] * cos( - * ( k + - ) * 1 ) 

* 1=0 8 2 
* 

* where: c[0] = 128 

* c[l. 7] = 128*sqrt(2) 

*/ 

#define W1 2841 /* 2048*sqrt(2)*cos(l*pi/16) */ 
#define W2 2676 /* 2048*sqrt(2)*cos(2*pi/16) V 
#define W3 2408 /* 2048*sqrt(2)*cos(3*pi/16) */ 
#define W5 1609 /* 2048*sqrt(2)*cos(5*pi/16) */ 
#defme W6 1108 /* 2048*sqrt(2)*cos(6*pi/16) */ 
#define W7 565 /* 2048*sqrt(2)*cos(7*pi/16) */ 

/* pnvate data */ 

static short iclip[1024]; /* clipping table */ 
static short *iclp; 

/* private prototypes */ 

static void idctrow _ANSl_ARGS_((short *blk)); 
static void idctcol _ANSI_ARGS_((short ’"blk)); 


static void idctrow(short *blk) 

{ 

int xO, xl, x2, x3, x4, x5, x6, x7, x8; 

/* shortcut */ 

if (!((xl = blk[4]«l 1) | (x2 = blk[6]) | (x3 = blk[2]) | 

(x4 = blk[l]) | (x5 = blk[7]) | (x6 = blk[5]) | (x7 = blk[3]))) 

{ 

blk[0]=blk[l]=blk[2]=blk[3]=blk[4]=blk[5]=blk[6]=blk[7]=blk[0]«3; 

return; 

} 

xO = (blk[0]«l 1) + 128; /* for proper rounding in the fourth stage */ 

I* first stage */ 
x8 = W7*(x4+x5); 
x4 = x8 + (Wl-W7)*x4; 
x5 = x8 - (Wl+W7)*x5; 
x8 = W3*(x6+x7); 
x6 = x8 - (W3-W5)*x6; 
x7 = x8 - (W3+W5)*x7; 


/* second stage */ 
x8 = xO + xl; 
xO -= xl; 


xl = W6*(x3+x2); 
x2 = xl - (W2+W6)*x2, 
x3 = xl + (W2-W6)*x3, 
xl = x4 + x6, 
x4 -= x6; 
x6 = x5 + x7, 
x5 -= x7; 


/♦ third stage */ 
x7 = x8 + x3; 
x8 -= x3; 
x3 = xO + x2; 
xO -= x2 ; 

x2 = (181 *(x4+x5)+ 1 28)»8, 
x4 = (181*(x4-x5)+128)»8; 

/* fourth stage */ 
blk[0] = (x7+xl)»8; 
blk[l] = (x3+x2)»8, 
blk[2] = (x0+x4)»8, 
blk[3] = (x8+x6)»8; 
blk[4] = (x8-x6)»8; 
blk[5] = (x0-x4)»8; 
blk[6] = (x3-x2)»8; 
blk[7] = (x7-xl)»8; 

} 

/* column (vertical) EDCT 

* 

* 7 pi 1 

* dst[8*k] = sum c[l] * src[8*l] * cos( -- * ( k + - ) * 1 ) 

* 1=0 8 2 
* 

* where: c[0] = 1/1024 

* c[1..7] = (l/1024)*sqrt(2) 

*/ 

static void idctcol(short *blk) 

{ 

int xO, xl, x2, x3, x4, x5, x6, x7, x8, 

/* shortcut */ 

if (!((xl = (blk[8*4]«8)) | (x2 = blk[8*6]) | (x3 = blk[8*2]) | 

(x4 = blk[8*l]) | (x5 = blk[8*7]) | (x6 = blk[8*5]) | (x7 = blk[8*3]))) 

{ 

blk[8*0]=blk(8*l]=blk[8*2]=blk[8*3]=blk[8*4]=blk[8*5]=blk[8*6]=blk[8*7] 

iclp[(blk[8*0]+32)»6]; 

return; 

} 

xO = (blk[8*0]«8) + 8192, 

/* first stage */ 

x8 = W7*(x4+x5) + 4; 

x4 = (x8+(Wl-W7)*x4)»3; 


x5 = (x8-(W 1 +W7)*x5)»3 , 
x8 = W3*(x6+x7) + 4; 
x6 = (x8-(W3-W5)*x6)»3; 
x7 = (x8-(W3+W5)*x7)»3; 


/* second stage */ 
x8 = xO + xl; 
xO -= xl; 

xl = W6*(x3+x2) + 4; 
x2 = (xl -(W2+W6)*x2)»3 ; 
x3 = (xl+(W2-W6)*x3)»3; 
xl = x4 + x6; 
x4 -= x6; 
x6 = x5 + x7; 
x5 -= x7; 

I* third stage *1 
x7 = x8 + x3; 
x8 — x3; 
x3 = xO + x2; 
xO -= x2; 

x2 = (181 *(x4+x5)+ 1 28)»8; 
x4 = (181*(x4-x5)+128)»8; 

/* fourth stage */ 
blk[8*0] = iclp[(x7+xl)»14]; 
blk[8*l] = iclp[(x3+x2)» 14]; 
blk[8*2] = iclp[(x0+x4)»14]; 
blk[8*3] = iclp[(x8+x6)» 14]; 
blk[8*4] = iclp[(x8-x6)»14]; 
blk[8*5] = iclp[(x0-x4)»14]; 
blk[8*6] = iclp[(x3 -x2)» 14]; 
blk[8*7] = iclp[(x7-xl)»14], 

} 

/* two dimensional inverse discrete cosine transform */ 
void idct(short *block) 

{ 

int i; 

for (i=0; i<8; i++) 
idctrow(block+8*i); 

for (i=0; i<8; i++) 
idctcol(block+i); 

} 

void init_idct() 

{ 

int i; 

iclp = iclip+512; 
for (i = -512; i<512; i"H-) 
iclp[i] = (i<-256) ? -256 : ((i>255) ? 255 : i); 


} 


/* form prediction for a complete picture (frontend for predictmb) 

* 

* reflf: reference frame for forward prediction 

* refb: reference frame for backward prediction 

* cur: destination (current) frame 

* secondfield: predict second field of a frame 

* mbi: macroblock info 

* 

* Notes: 

* - cf. predict mb 
*/ 

void predict(unsigned char *reff[], unsigned char *refb[], 
unsigned char *cur[3], int secondfield, 
struct mbinfo *mbi ) 

{ 

int i, j, k; 
k = 0; 

/* loop through all macroblocks of the picture */ 
for 0=0, J<height2; j+=16) 
for (i=0; i<width; i+=16) 

{ 

predict_mb(reff, refb, cur, width, ij,pict_type,pict_struct, 
mbi [k] . mbtype.mbi [k] . motion_type, secondfield, 
mbi[k].MV,mbi[k].mv_field_sel,mbi[k].dmvector); 

k++; 

} 

} 

/* form prediction for one macroblock 

* 

* oldref: reference frame for forward prediction 

* newref: reference frame for backward prediction 

* cur: destination (current) frame 

* lx: frame width (identical to global var width') 

* bx,by: picture (field or frame) coordinates of macroblock to be predicted 

* pict_type: I, P or B 

* pict_struct: FRAMEPICTURE, TOPJFIELD, BOTTOM FIELD 

* mb type. MB FORWARD, MB BACKWARD, MB INTRA 

* motion_type: MC_FRAME, MC_F1ELD, MC_16X8, MC_DMV 

* secondfield: predict second field of a frame 

* PMV[2][2][2]: motion vectors (in half pel picture coordinates) 

* mv_field_sel[2][2]: motion vertical field selects (for field predictions) 

* dmvector: differential motion vectors (for dual prime) 

* 

* Notes: 

* - when predicting a P type picture which is the second field of 

* a frame, the same parity reference field is in oldref, while the 


* opposite parity reference field is assumed to be in newref! 

* - intra macroblocks are modelled to have a constant prediction of 128 

* for all pels; this results tn a DC DCT coefficient symmetric to 0 

* - vectors for field prediction in frame pictures are in half pel frame 

* coordinates (vertical component is twice the field value and always 

* even) 

* 

* already covers dual prime (not yet used) 

*/ 

static void predict_mb(unsigned char *oldrefl], unsigned char *newrefl], 
unsigned char *cur[], 
int lx, int bx, int by, 
int pict_type, int pict_struct, 
int mb_type, int motion type, 
int secondfield, 
int PMV[2][2][2], 
int mv_field_sel[2][2] , 
int dmvector[2] ) 

{ 

int addflag, currentfield; 
unsigned char **predframe; 
int DMV[2](2]; 

if (mb_type&MB INTRA) 

{ 

clearblock(cur,bx,by); 

return; 

} 

addflag = 0; /* first prediction is stored, second is added and averaged */ 

if ((mb type & MB_FORWARD) || (pict_type=P_TYPE)) 

{ 

/* forward prediction, including zero MV in P pictures */ 

if (pict_struct=FRAME_PICTURE) 

{ 

/* frame picture */ 

if ((motion_type=MC_FRAME) || <(mb type & MB FORWARD)) 

{ 

/* frame-based prediction in frame picture */ 
pred(oldref,0,cur,0, 

lx, 1 6, 1 6,bx,by,PMV[0] [0] [0],PMV[0] [0] [ 1 ] ,0); 

} 

else if (motion_type=MC_FIELD) 

{ 

/* field-based prediction in frame picture 


* note scaling of the vertical coordinates (by, PMV[][0][1]) 

* from frame to field! 



/* top field prediction */ 
pred(oldref,mv_field_sel[0][0],cur,0, 
lx« 1 , 1 6,8,bx,by» 1 ,PMV[0] [0] [0],PMV[0] [0] [ 1 ]» 1 ,0); 

/* bottom field prediction */ 
pred(oldref,mv_field_sel[ l][0],cur, 1, 
lx« 1 , 16,8,bx,by» 1 ,PM V[ 1 ] [0] [0],PMV[ 1 ] [0] [ 1 ]» 1 ,0), 

} 

else if (motion_type=MC_DMV) 

{ 

/* dual prime prediction */ 

/* calculate derived motion vectors */ 
calc_DMV(DMV,dmvector,PMV[0J[0][0],PMV[0J[0] [ 1 ]» 1 ); 

/* predict top field from top field */ 
pred(oldref,0,cur,0, 

lx« 1 , 16,8,bx,by» 1 ,PMV[0] (0] [0],PMV[0] [0] [ 1 ]» 1 ,0), 

/* predict bottom field from bottom field */ 
pred(oldref,l,cur, 1, 

lx« 1 , 1 6,8,bx,by» 1 ,PMV[0] (0] [0] ,PMV[0] [0] [ I J» 1 ,0); 

/* predict and add to top field from bottom field */ 
pred(oldref,l,cur,0, 

lx« 1 , 1 6,8,bx,by» 1,DMV[0] [0],DMV[0] [ 1 ], 1 ); 

/* predict and add to bottom field from top field */ 
pred(oldref,0,cur,l, 

lx« 1 , 1 6,8,bx,by» 1 ,DMV[ 1 ] [0],DMV[ 1 ][ 1 ] , 1 ); 

} 

else 

{ 

I* invalid motion type in frame picture */ 
if (Iquiet) 

fprintf(stderr, "invalid motion_type\n"), 

} 

} 

else /* TOP_FIELD or BOTTOM FIELD */ 

{ 

/* field picture */ 

currentfield = (pict_struct=BOTTOM_FIELD); 

/* determine which frame to use for prediction */ 
if ((pict_type=P_TYPE) && secondfield 
&& (currentfield ! =mv_field_sel [0] [0] )) 
predframe = newref; /* same frame */ 
else 

predframe = oldref; /* previous frame *1 

if ((motion_type=MC_FIELD) || !(mb_type & MB FORWARD)) 

{ 

/* field-based prediction in field picture */ 


pred(predframe,mv_field_sel[0][0], cur, currentfield, 
lx« 1 , 1 6, 1 6,bx,by,PM V[0] [0] [0],PMV[0] [0] [ 1 ] ,0); 

} 

else if (motion_type=MC_16X8) 

{ 

/* 16x8 motion compensation in field picture */ 

/* upper half */ 

pred(predframe,mv_field_sel[0][0], cur, currentfield, 
lx« 1 , 1 6,8,bx,by,PMV[0] [0] [0],PMV[0] [0] [ 1 ],0); 

/* determine which frame to use for lower half prediction */ 
if ((pict_type=P_TYPE) && secondfield 
&& (currentfield ! =mv_field_sel [ 1 ] [0] )) 
predframe = newref; /* same frame */ 
else 

predframe = oldref; /* previous frame */ 

/* lower half */ 

pred(predframe,mv_field_sel[ 1 ] [0], cur, currentfield, 
lx« 1 , 1 6,8,bx,by+8,PMV[ 1 ] [0] [0] ,PMV[ 1 ] [0] [ 1 ],0); 

} 

else if (motion_type=MC_DMV) 

{ 

/* dual prime prediction */ 

/* determine which frame to use for prediction */ 
if (secondfield) 

predframe = newref; /* same frame */ 
else 

predframe = oldref; /* previous frame */ 

/* calculate derived motion vectors */ 
calc_DMV(DMV,dmvector,PMV[0] [0] [0],PMV[0][0][ 1 ]); 

/* predict from field of same parity */ 
pred(oldref,currentfield,cur, currentfield, 
lx« 1 , 1 6, 1 6,bx,by,PMV[0] [0] [0] ,PMV[0] [0] [ 1 ] ,0); 

/* predict from field of opposite parity */ 
pred(predfirame, ! currentfield,cur, currentfield, 
lx« 1,16,1 6,bx,by,DMV[0] [0],DMV[0] [ 1 ], 1 ); 

} 

else 

{ 

/* invalid motion type in field picture */ 
if ( (quiet) 

fprintf(stderr, "invalid motion_type\n"); 

} 

} 

addflag = 1; I* next prediction (if any) will be averaged with this one */ 


if (mb_type & MB_BACK WARD) 



{ 

/* backward prediction */ 

if (pict_struct=FRAME_PICTURE) 

{ 

/* frame picture */ 

if (motion_type=MC_FRAME) 

{ 

/* frame-based prediction in frame picture */ 
pred(newref,0,cur,0, 

lx, 1 6, 1 6,bx,by,PMV[0] [ 1 ] [0] ,PMV[0] [ 1 ] [ 1 ] .addflag), 

} 

else 

{ 

/* field-based prediction in frame picture 
* 

* note scaling of the vertical coordinates (by, PMV[][1][1]) 

* from frame to field! 

*/ 

/* top field prediction */ 
pred(newref,mv_field_sel [0] [ 1 ],cur,0, 
lx«l,16,8,bx,by»l,PMV[0][l][0],PMV[0][l][]]»l,addflag); 

/* bottom field prediction */ 
pred(newref,mv_field_sel [ 1 ] [ 1 ],cur, 1 , 
lx« 1 , 1 6,8,bx,by» 1 ,PMV[ 1 ] [ 1 ] [0] ,PMV[ 1 ] [ 1 ] [ 1 ]» 1 , addflag); 

} 

} 

else /* TOP_FIELD or BOTTOM FIELD */ 

{ 

/* field picture */ 

currentfield = (pict_struct=BOTTOM_FIELD); 

if (motion_type=MC_FIELD) 

{ 

/* field-based prediction in field picture */ 
pred(newref,mv_field_sel [0] [ 1 ] ,cur, currentfield, 
lx« 1,16, 16,bx,by,PMV[0] [ 1 ] [0] ,PMV[0] [ 1 ] [ 1 ] .addflag); 

} 

else if (motion_type=MC_16X8) 

{ 

/* 16x8 motion compensation in field picture */ 

/* upper half */ 

pred(newref,mv_field_sel [0] [ 1 ] , cur, currentfield, 
lx« 1 , 1 6,8,bx,by,PMV[0] [ 1 ] [0] ,PMV[0] ( 1 ] [ 1 ], addflag); 

/* lower half*/ 

pred(newref,mv_field_sel[ 1 ] [ 1 ], cur, currentfield, 
lx« 1 , 16,8,bx,by+8,PMV[ 1] [ 1 ] [0],PMV[ 1 ] [ 1 ] [ 1 ], addflag); 

} 


else 

{ 

/* invalid motion_type in field picture *1 
if (Iquiet) 

fprintf(stderr, "invalid motion_type\n"); 

} 

} 


/* predict a rectangular block (all three components) 

* 

* src: source frame (Y,U,V) 

* sfield source field select (0: frame or top field, 1: bottom field) 

* dst: destination frame (Y,U,V) 

* dfield: destination field select (0: frame or top field, 1 : bottom field) 

* 

* the following values are in luminance picture (frame or field) dimensions 

* lx: distance of vertically adjacent pels (selects frame or field pred.) 

* w,h: width and height of block (only 16x16 or 16x8 are used) 

* x,y: coordinates of destination block 

* dx,dy: half pel motion vector 

* addfiag: store or add (= average) prediction 
*/ 

static void predesigned char *src[], int sfield, 
unsigned char *dst[], int dfield, 
int lx, int w, int h, 

int x, int y, int dx, int dy, int addfiag ) 

{ 

int cc; 

for (cc=0; cc<3; cc++) 

{ 

if (cc=l) 

{ 

/* scale for color components */ 
if (chroma_format=CHROMA420) 

{ 

I* vertical *1 

h »= 1; y »= 1; dy /= 2; 

} 

if (chroma_format!=CHROMA444) 

{ 

/* horizontal */ 

w »= 1; x »= 1; dx /= 2; 
lx»= 1; 

} 

} 

pred_comp(src[cc]+(sfield?lx» 1 :0),dst[cc]+(dfield?lx» 1 :0), 
lx, w,h,x,y,dx,dy, addfiag); 

} 

} 


/* low level prediction routine 


* src prediction source 

* dst: prediction destination 

* lx: line width (for both src and dst) 

* x,y: destination coordinates 

* dx,dy: half pel motion vector 

* w,h: size of prediction block 

* addflag: store or add prediction 
*/ 

static void pred_comp(unsigned char *src, unsigned char ♦dst, 
int lx, int w, int h, 
int x, int y, 

int dx, int dy, int addflag ) 

{ 

int xint, xh, yint, yh; 
mti.j; 

unsigned char *s, *d; 

/* half pel scaling */ 

xint = dx»l; /* integer part *1 

xh = dx & 1; /* half pel flag */ 

yint = dy»l; 

yh = dy & 1; 

/* origins */ 

s = src + lx*(y+yint) + (x+xint); /* motion vector */ 
d = dst + lx*y + x; 

if (!xh && !yh) 
if (addflag) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<w; i++) 

d[i] = (unsigned int)(d[i]+s[i]+l)»l; 
s+= lx; 
d+= lx; 

} 

else 

for (j=0; j<h; j++) 

{ 

for (i=0; i<w; i++) 
d[i] = s[i); 
s+= lx; 
d+= lx; 

} 

else if (!xh && yh) 
if (addflag) 
for O=0;j<h;j++) 

{ 

for (i=0; i<w, i++) 

d[i] = (d[i] + ((unsigned int)(s[i]+s[i+lx]+l)»l)+l)»i; 
s+= lx; 
d+= lx; 


} 

else 

for (j=0; j<h; j++) 

{ 

for (i=0, i<w; i++) 

d[i] = (unsigned int)(sli]+s[i+lx]+l)»l; 
s+= lx; 
d+= lx; 

} 

else if (xh && !yh) 
if (addflag) 
for O=0;j<h;j++) 

{ 

for (i=0; i<w, i++) 

d[i] = (d[i] + ((unsigned int)(s[i]+s[i+l]+l)»l)+l)»l; 
s+= lx; 
d+= lx; 

} 

else 

for (j=0; j<h; j++) 

{ 

for (i=0; i<w; i++) 

d(i] = (unsigned int)(s[i]+s[i+l]+l)»l; 
s+= lx; 
d+= lx; 

} 

else /* if (xh && yh) */ 
if (addflag) 
for (j=0;j<h;j++) 

{ 

for (i=0; i<w; i++) 

d[i] = (d[i] + ((unsigned int)(s[i]+s[i+l]+s[i+lx]+s[i+lx+l]+2)»2)+l)»l; 
s+= lx; 
d+= lx; 

} 

else 

for (j=0;j<h;j++) 

{ 

for (i=0; i<w; i++) 

d[i] = (unsigned int)(s[i]+s[i+l]+s[i+lx]+s[i+lx+l]+2)»2; 
s+= lx; 
d+= lx; 

} 


/* calculate derived motion vectors (DMV) for dual prime prediction 

* dmvector[2]: differential motion vectors (-1,0,+1) 

* mvx,mvy: motion vector (for same parity) 

* 

* DMV[2][2]- derived motion vectors (for opposite parity) 

* 

* uses global variables pict_struct and topfirst 

* 


* Notes: 

* - all vectors are in field coordinates (even for frame pictures) 

*/ 

static void calc_DMV(mt DMV[][2], int *dmvector, int mvx, int mvy ) 

{ 

if (pict_struct=FRAME_PICTURE) 

{ 

if (topfirst) 

{ 

/* vector for prediction of top field from bottom field */ 

DMV[0][0) = ((mvx +(mvx>0))»l) + dmvector[0]; 

DMV[0][1] = ((mvy +(mvy>0))»l) + dmvector[l] - 1; 

/* vector for prediction of bottom field from top field */ 

DMV[1][0] = ((3 *mvx+(mvx>0))» 1 ) + dmvector[0]; 

DMV[1][1] = ((3*mvy+(mvy>0))»l) + dmvector[l] + 1, 

} 

else 

{ 

/* vector for prediction of top field from bottom field */ 

DMV[0][0] = ((3*mvx+(mvx>0))»l) + dmvector[0]; 

DMV[0][1] = ((3*mvy+(mvy>0))»l) + dmvector[l] - 1; 

/* vector for prediction of bottom field from top field *1 
DMV[1][0] = ((mvx +(mvx>0))»l) + dmvector(0]; 

DMV[1][1] = ((mvy +(mvy>0))»l) + dmvector[l] + 1; 

} 

} 

else 

{ 

/* vector for prediction from field of opposite ‘parity’ */ 

DMV[0][0] = ((mvx+(mvx>0))»l) + dmvector[0]; 

DMV[0](1] = ((mvy+(mvy>0))»l) + dmvectorfl]; 

/* correct for vertical field shift */ 
if (pict_struct=TOP_FIELD) 

DMV[0][1]— ; 
else 

DMV[0][1]++; 

} 


static void clearblock(unsigned char *cur[], int iO, int jO ) 

{ 

int i, j, w, h; 
unsigned char *p; 

p = cur[0] + ((pict_struct=BOTTOM_FIELD) ? width : 0) + iO + width2*j0, 

for (j=0; j<16; j++) 

{ 

for (i=0; i<16; i++) 
p[i] = 128; 


p+= width2; 

} 

w = h = 16; 

if (chroma_format!=CHROMA444) 

{ 

iO»=l; w»=l; 

} 

if (chroma_fomiat=CHROMA420) 

{ 

jO»=l; h»=l ; 

} 

p = cur[l] + ((pict_struct=BOTTOM_FIELD) ? chromwidth : 0) + iO 
+ chrom_width2*j0; 

for (j=0; j<h;j++) 

{ 

for (i=0; i<w; i++) 
p[i] = 128; 
p+= chrom_width2; 

} 

p = cur[2] + ((pict_strucl=BOTTOM_FIELD) ? chrom width : 0) + iO 
+ chrom_width2*j0; 

for (j=0;j<h;j++) 

{ 

for (i=0; i<w; i++) 
p[i] = 128; 
p+= chrom_width2; 

} 

} 


/*********************************/ 


/* 


*/ 


/* putpic.cpp *1 

I* */ 

/*********************************/ 


#include <stdio.h> 

#include <math.h> 

^include "c:\nick\mpeg\org\config.h" 
#include "c:\nick\mpeg\org\global.h" 
^include "c:\nick\mpeg\org\vlc.h" 


extern FILE *outfile; I* the only global var we need here */ 

/* private data */ 
static unsigned char outbfr; 
static int outcnt; 
static int bytecnt; 

/* private prototypes */ 

static void iquantl intra _ANSI_ARGS_((short *src, short *dst, int dc_prec, 
unsigned char *quant_mat, int mquant)); 
static void iquantl_non_intra _ANSI_ARGS_((short *src, short *dst, 

unsigned char *quant_mat, int mquant)); 
static void calc_actj _ANSI_ARGS_((unsigned char *frame)); 
static double var sblk _ANSI_ARGS_((unsigned char *p, int lx)); 

static void putDC _ANSI_ARGS_((sVLCtable *tab, int val)); 

static int frametotc _ANSI_ARGS_((int frame)); 

static void putmvs _ANSI_ARGS_((int MV[2][2][2], 
int PMV[2][2][2], 
int mv_field_sel[2][2], 
int dmvector[2], 
int s, int motion type, 
int hor f code, int vert f code)); 

/* quantization / variable length encoding of a complete picture */ 
void putpict(unsigned char *frame) 

{ 

int i, j, k, comp, cc; 
int mb_type; 
int PMV[2][2][2]; 
int prevmquant; 
int cbp, MBAinc; 

rc_init_pict(frame), /* set up rate control */ 

/* picture header and picture coding extension */ 
putpicthdrQ; 


if (tmpegl) putpictcodext(), 


prevmquant = rc_start_mb(); /* initialize quantization parameter */ 
k = 0; 

for 0=0; j<mb_height2; j++) 

{ 

/* macroblock row loop */ 

for (i=0; i<mb_width; i++) 

{ 

/* macroblock loop */ 
if (i=0) 

{ 

/* slice header (6.2.4) */ 
alignbits(); 

if (mpegl || vertical_size<=2800) 
putbits(SLICE_MIN_START+j,32); I* s!ice_start_code *1 
else 
{ 

putbits(SLICE_MIN_START+(j&127),32); /* slice_start_code */ 
putbits(j»7,3); /* slice_vertical_position_extension */ 

} 

/* quantiser_scale_code */ 

putbits(q_scale_type ? map_non_linear_mquant[prev_mquant] 

: prev mquant » 1, 5); 

putbits(0,l); f* extra bit slice */ 

/* reset predictors */ 

for (cc=0; cc<3; cc++) 
dc_dctjsred[cc] = 0; 

PMV[0] [0] [0]=PMV[0] [0] [ 1]=PMV[ 1 ] [0][0]=PMV[ 1] [0] [ 1 ]=0; 
PMV[0][1][0]=PMV[0][1][1]=PMV[1][1][0]=PMV[1][1][1]=0; 

MBAinc = i + 1; /* first MBAinc denotes absolute position */ 

} 

mb_type = mbinfo[k].mb_type; 

/* determine mquant (rate control) */ 
mbinfo[k].mquant = rc_calc_mquant(k); 

/* quantize macroblock */ 
if (mbtype & MB_INTRA) 

{ 

for (comp=0; comp<block_count, comp++) 

quant_intra(blocks[k*block_count+comp], blocks [k*block_count+comp], 
dc_prec,intra_q,mbinfo[k]. mquant); 
mbinfo[k].cbp = cbp = (l«block_count) * 1; 


} 

else 

{ 

cbp = 0; 

for (comp=0;eomp<block_count;comp-H-) 
cbp = (cbp«l) | quant_non_intra(blocks[k*block_count+comp], 
blocks[k*block_count+comp], 
inter_q,mbinfo[k] . mquant); 


mbinfo[k].cbp = cbp, 
if (cbp) 

mb_type|= MB_P ATTERN ; 

} 

/* output mquant if it has changed */ 
if (cbp && prev_mquant!=mbinfo{k]. mquant) 
mb_type|= MBQUANT; 

/* check if macroblock can be skipped */ 
if (i!=0 && i!=mb_width-l && !cbp) 

{ 

/* no DCT coefficients and neither first nor last macroblock of slice */ 

if (pict type=P_TYPE && !(mb_type&MB_FORWARD)) 

{ 

/* P picture, no motion vectors -> skip */ 

/* reset predictors */ 

for (cc=0; cc<3; cc++) 
dc_dct_pred[cc] = 0; 

PMV[0] (0] [0]=PMV[0] [0] [ 1 ]=PMV[ 1 ] [0] [0]=PMV[ 1 ] [0] [ 1 ]=0; 

PMV[0] [ 1 ] [0]=PMV[0] ( 1 ] [ 1 ]=PMV[ 1 ] [ 1 ] [0]=PMV( 1 ] [ 1 ] [ 1 ]=0; 

mbinfo[k].mb_type = mbtype; 
mbinfofk]. skipped = 1; 

MBAinc-H-; 

k++; 

continue; 

} 

if (pict_type=B_TYPE && pict_struct=FRAME_PICTURE 
&& mbinfo[k].motion_type=MC_FRAME 

&& ((mbinfo[k-l].mb_type A mb_type)&(MB_FORWARD|MB_BACKWARD))==0 
&& (!(mb_type&MB_FORWARD) || 

(PMV[0] [0] [0]=mbinfo[k] . MV[0] [0] [0] && 

PMV[0] [0] [ 1 ]=mbinfo[k].MV[0] [0] [ 1 ])) 

&& (!(mb_type&MB_BACKWARD) || 

(PMV[0] [ 1 ] [0]=mbinfo[k] MV[0][1][0] && 

PMV[0] [ 1 ] [ 1 ]=mbinfo[k] MV[0] [ 1 ] [ 1 ]))) 

{ 

/* conditions for skipping in B frame pictures: 


* - must be frame predicted 

* - must be the same prediction type (forward/backward/interp ) 

* as previous macroblock 

* - relevant vectors (fonvard/backward/both) have to be the same 

* as in previous macroblock 

*/ 

mbinfo[k].mb_type = mbtype; 
mbinfofk] skipped = 1; 

MBAmc-H-; 

k++; 

continue; 

} 

if (pict_type=B_TYPE && pict_struct!=FRAME_PICTURE 
&& mbinfo[k].motion_type=MC_FTELD 

&&((mbinfotk-l].mb_type A mb_type)&(MB_FORWARD|MB_BACKWARD))==0 
&& (!(mb_type&MB_FORWARD) || 

(PMV[0] [0] [0]=mbinfo[k] MV[0] (0] [0] && 

PMV[0] [0] [ 1 ]=mbinfo[k].MV[0] [0] [ 1 ] && 
mbinfo[k].mv_field_sel[0][0]=(pict_struct==BOTTOM_FIELD))) 

&& ( ! (mb_type&MB_B ACK WARD) || 

(PMV[0] [ 1] [0]=mbinfo[k].MV[0] [ 1 ] [0] && 

PMV[0] [ 1 ] [ 1 ]=mbinfo[k] MV[0] [ 1 ] [ 1 ] && 
mbinfo[k].mv_field_sel[0][l]=(pict_struct=BOTTOM_FIELD)))) 

{ 

/* conditions for skipping in B field pictures: 

* - must be field predicted 

* - must be the same prediction type (forwarchbackward/interp. ) 

* as previous macroblock 

* - relevant vectors (fonvard/backward/both) have to be the same 

* as in previous macroblock 

* - relevant motion_vertical_field_selects have to be of same 

* parity as current field 
*/ 

mbinfo[k].mb_type = mbtype; 
mbinfo[k]. skipped = 1; 

MBAinc++; 

k++; 

continue; 

} 

} 

/* macroblock cannot be skipped */ 
mbinfo[k]. skipped = 0; 

/* there's no VLC for 'No MC, Not Coded': 

* we have to transmit (0,0) motion vectors 
V 

if (pict_type=P_TYPE && !cbp && !(mb_type&MB_FORWARD)) 
mb_type|= MBFORWARD, 

putaddrinc(MBAinc); /* macroblock address increment */ 


MBAinc = 1; 


putmbtype(pict_type,mb_type); /* macroblock type */ 

if (mbtype & (MB_FORWARD|MB_BACKWARD) && !frame_pred_dct) 
putbits(mbinfo[k].motion_type,2); 

if (pict_struct=FRAME_PICTURE && cbp && !frame_pred_dct) 
putbits(mbinfo[k] ,dct_type, 1 ); 

if (mb type & MB_QUANT) 

{ 

putbits(q_scale_type ? map_non_linear_mquant[mbinfo[k].mquant] 

: mbinfo[k].mquant»l,5); 
prevmquant = mbinfo[k].mquant; 

} 

if (mb type & MB FORWARD) 

{ 

/* forward motion vectors, update predictors */ 

putmvs(mbinfo[k] .MV,PMV,mbinfo[k] ,mv_field_sel,mbinfo[k) .dmvector.O, 
mbinfo[k].motion_type,forw_hor_f_code,forw_vert_f_code); 

} 

if (mb type & MB_BACKWARD) 

{ 

/* backward motion vectors, update predictors */ 

putmvs(mbinfo[k].MV,PMV,mbinfo[k].mv_field_sel,mbinfo[k].dmvector,l, 

mbinfo[k].motion_type,back_hor_f_code,back_vert_f_code); 

} 

if (mb type & MB PATTERN) 

{ 

putcbp((cbp » (block_count-6)) & 63); 
if (chroma_format!=CHROMA420) 
putbits(cbp,block_count-6); 

} 

for (comp=0; comp<block_count; comp++) 

{ 

/* block loop */ 

if (cbp & (l«(block_count-l-comp))) 

{ 

if (mb type & MB INTRA) 

{ 

cc = (comp<4) ? 0 : (comp&l)+l; 
putintrablk(blocks[k*block_count+comp],cc), 

} 

else 

putnonintrablk(blocks[k*block_count+comp]), 

} 

} 


/* reset predictors */ 


if (!(mb_type & MBINTRA)) 
for (cc=0; cc<3; cc++) 
dc_dct_pred[cc] = 0; 

if (mb_type & MB_INTRA || (pict_t>-pe=P_TYPE && !(mb_type & MBFORWARD))) 

{ 

PMV[0] [0] [0]=PMV[0] [0] [ 1 ]=PMV[ 1 ] [0] [0]=PMV[ 1 ] [0] [ 1 ]=0; 
PMV[0][1][0]=PMV[0][1][1]=PMV[1][1][0]=PMV[1][1][1]=0, 

} 

mbinfo[k].mb_type = mbtype; 
k++; 

} 

} 

rc_update_pict(); 

vbv_end_of_picture(); 

} 


/* output motion vectors (6.2.5.2, 6.3. 16.2) 

* 

* this routine also updates the predictions for motion vectors (PMV) 
*/ 


static void putmvs(int MV[2][2][2], 
intPMV[2][2][2], 
int mv_field_sel[2][2], 
int dmvector[2], 
int s, int motion type, 
int horfcode, int vertfcode ) 

{ 

if (pict_struct=FRAME_PICTURE) 

{ 

if (motion_type=MC_FRAME) 

{ 

/* frame prediction */ 

putmv(MV[0] [s] [0]-PMV[0] [s] [0],hor_f_code); 
putmv(MV[0] [s] [ 1 ]-PMV[0] [s] [ 1 ],vert_f_code); 

PMV[0] [s] [0]=PMV[ 1 ] [s] [0]=MV[0] [s] [0] ; 

PMV[0] [s] [ 1 ]=PMV[ 1 ] [s] [ 1 ]=MV[0] [s] [ 1 ] ; 

} 

else if (motion_type=MC_FIELD) 

{ 

/* field prediction */ 

putbits(mv_field_sel[0][s], 1); 

putmv(MV[0] [s] [0]-PMV[0] [s] [0] .horfcode); 
putmv((MV[0] [s] [ 1 ]» 1 )-(PMV[0] [s] [ 1 ]» l),vert_f_code); 
putbits(mv _field_sel[ 1 ] [s], 1 ); 
putmv(MV[ 1 ] [s] [0] -PMV[ 1 ] [s] [0],hor_f_code); 
putmv((MV[ 1 ] [s] [ 1 ]» 1 )-(PMV[ 1 ] [s] [ 1 ]» 1 ),vert_f_code); 
PMV[0) [s] [0]=MV[0] [s] [0] ; 

PMV[0] [s] [ 1 ]=MV[0] [s] [ 1 ] ; 

PMV[ 1] [s] [0]=MV[ 1 ] [s] [0] ; 


PMVIl][s][l)=MV[l]ls]|l], 

} 

else 

{ 

/* dual prime prediction */ 

putmv(MV[0] [s] [0]-PMV[0] [s] [0],hor_f_code); 

putdmv(dmvector[0]); 

putmv((MV[0] [s] [ 1]» 1 )-(PMV[0] [s] [ 1 ]» 1 ),vert_f_code), 
putdmv(dmvector[ 1 ]); 

PMV[0] [s] [0]=PMV[ 1 ] [s] [0]=MV[0] [s] [0]; 

PMV[0] [s] [ 1 ]=PMV[ 1 ] [s] [ 1 ]=MV[0] [s] [ 1 ]; 

} 

} 

else 

{ 

/* field picture */ 
if (motion_type=MC_FIELD) 

{ 

/* field prediction */ 
putbits(mv_field_sel[0][s], 1); 
putmv(MV[0] [s] [0]-PMV[0] [s] [0] ,hor_f_code), 
putmv(MV[0] [s] [ 1]-PMV[0] [s] [ 1 ],vert_f_code); 

PMV[0] [s] [0]=PMV[ 1 ] [s] [0]=MV(0] [s] [0] ; 

PMV[0] [s] [ 1 ]=PMV[ 1 ] [s] [ 1 ]=MV[0] [s] [ 1 ] ; 

} 

else if (motion_type=MC_16X8) 

{ 

/* 16x8 prediction */ 

putbits(mv_field_sel[0][s], 1); 

putmv(MV[0] [s] [0]-PMV[0] [s] [0] ,hor_f_code); 

putmv(MV[0] [s] [ 1 ] -PMV[0] [s] [ 1 ] ,vert_f_code); 

putbits(mv_field_sel [ 1 ] [s] , 1 ); 

putmv(MV[ 1 ] [s] [0]-PMV[ 1 ] [s] [0] ,hor_f_code); 

putmv(MV[ 1 ] [s] [ 1 ]-PMV[ 1 ] [s] [ 1 ],vert_f_code), 

PMV[0] [s] [0]=MV[0] [s] [0] ; 

PMV[0] [s] [ 1 ]=MV[0] [s] [ 1 ] , 

PMV[ 1 ] [s] [0]=MV[ 1 ] [s] [0] ; 

PMV[ 1 J [s] [ 1 ]=MV[ 1 ] [s] [ 1 ] ; 

} 

else 

{ 

/* dual prime prediction *1 

putmv(MV[0] [s] [0]-PMV(0] [s] [0],hor_f_code), 

putdmv(dmvector[0]); 

putmv(MV[0] [s] [ 1 ]-PMV[0] [s] [ 1 ],vert_f_code); 
putdmv(dmvector[ 1 ]); 

PMV[0]{s][0]=PMV[ 1 ] [s] [0]=MV[0] [s] [0] , 

PMV[0] [s] [ 1 ]=PMV[ 1 ] [s] [ 1 ]=MV[0] [sjflj; 

} 

} 

} 

/* generate variable length codes for an intra-coded block (6.2.6, 6.3. 17) */ 
void putintrablk(short *blk, int cc) 


{ 

int n, dctdiff, run, signed_level; 

/* DC coefficient (7.2.1)*/ 

dct_di£f = blk[0] - dc_dct_pred[cc]; /* difference to previous block */ 
dc_dct_pred[cc] = blk[0], 


if (cc==0) 

putDClum(dct_diff); 

else 

putDCchrom(dct_diff); 

/* AC coefficients (7.2.2) */ 
run = 0; 

for (n=l; n<64; n++) 

{ 

/* use appropriate entropy scanning pattern */ 

signedjevel = blk[(altscan ? altemate_scan : zig_zag_scan)[n]]; 

if (signedjevel !=0) 

{ 

putAC(run,signedJevel,intravlc); 
run = 0; 

} 

else 

run-H-; /* count zero coefficients */ 

} 

/* End of Block -- normative block punctuation */ 
if (intravlc) 

putbits(6,4); /* 01 10 (Table B-15) */ 
else 

putbits(2,2); /* 10 (Table B-14) */ 

} 

/* generate variable length codes for a non-intra-coded block (6.2 6, 6.3. 17) */ 
void putnonintrablk(short *blk) 

{ 

int n, run, signedjevel, first; 

run = 0; 
first = 1; 

for (n=0; n<64; n++) 

{ 

/* use appropriate entropy scanning pattern */ 

signedjevel = blk[(altscan ? altemate_scan : zig_zag_scan)(n]|; 

if (signed jevel !=0) 

{ 

if (first) 

{ 

/* first coefficient in non-intra block */ 
putACfirst(run, signedjevel); 
first = 0; 


} 

else 

putAC(run,signed_level,0); 
run = 0, 

} 

else 

run++; /* count zero coefficients *1 

} 

/* End of Block - normative block punctuation */ 
putbits(2,2); 

} 

/* generate variable length code for a motion vector component (7.6.3. 1) */ 
void putmv(int dmv, int f_code) 

{ 

int r_size, f, vmin, vmax, dv, temp, motion_code, motion_residual, 

r size = f_code - 1; /* number of fixed length code ('residual') bits */ 
f = l«r_size; 

vmin = -16*f; /* lower range limit */ 
vmax = 16*f - 1 ; /* upper range limit */ 
dv = 32*f; 


/* fold vector difference into [vmin.. .vmax] */ 
if (dmv>vmax) 
dmv— dv; 
else if (dmv<vmin) 
dmv+= dv; 

/* check value */ 
if (dmv<vmin || dmv>vmax) 
if (Iquiet) 

fprintfistderr, "invalid moUon vectorVn"); 

/* split dmv into motion code and motion residual */ 
temp = ((dmv<0) ? -dmv : dmv) + f - 1 ; 
motioncode = temp»r_size; 
if (dmv<0) 

motioncode = -motioncode; 
motion_residual = temp & (f-1); 

putmotioncode(motion code); I* variable length code */ 

if (r_size!=0 && motion_code!=0) 
putbits(motion_residual,r_size), /* fixed length code */ 

} 


/* generate sequence header (6.2.2. 1, 6.3.3) 

♦ 

* matrix download not implemented 
*/ 


void putseqhdrO 

{ 

int i; 

alignbitsO, 

putbits(SEQ_START_CODE,32); /* sequence_header_code */ 
putbits(horizontal_size,12); /* horizontal_size_value */ 
putbits(vertical_size,12); /* vertical size value */ 
putbits(aspectratio,4); /* aspect_ratio_information */ 
putbits(frame_rate_code,4); /* frame_rate_code */ 
putbits((int)ceil(bit_rate/400.0),18); /* bit_rate_value */ 
putbits(l,l); /* marker bit */ 

putbits(vbv_buffer_size,10); /* vbv_bufFer_size_value */ 
putbits(constrpanns,l); /* constrained_parameters flag */ 

putbits(load_iquant,l); /* load_intra_quantizer_matrix V 
if (load_iquant) 

for (i=0, i<64; i++) I* matrices are always downloaded in zig-zag order */ 
putbits(intra_q[zig_zag_scan[i]],8); /* intra_quantizer_matrix */ 

putbits(load_niquant,l); /* load_non_intra_quantizer_matrix */ 
if (load_niquant) 
for (i=0; i<64; i++) 

putbits(inter_qlzig_zag_scan[i]],8); /* non_intra_quantizer_matrix */ 

} 

/* generate sequence extension (6.2. 2. 3, 6.3.5) header (MPEG-2 only) */ 
void putseqextO 
{ 

alignbitsO; 

putbits(EXT_START_CODE,32), /* extension_start_code */ 
putbits(SEQ_DD,4); /* extension_start_code_identifier */ 
putbits((profile«4)|level,8); /* profile_and_level_indication */ 
putbits(prog_seq,l); /* progressive sequence */ 
putbits(chroma_format,2); /* chroma_format */ 
putbits(horizontal_size»12,2); /* horizontal size extension */ 
putbits(vertical_size»12,2); /* vertical_size_extension */ 
putbits(((int)ceil(bit_rate/400.0))»18,12); /* bit_rate_extension *1 
putbits(l,l); /* marker_bit */ 

putbits(vbv_bu£fer_size»10,8); /* vbv_buffer_size_extension */ 
putbits(0,l); /* low_delay — currently not implemented */ 
putbits(0,2); /* frame_rate_extension_n */ 
putbits(0,5); /* frame_rate_extension_d */ 

} 

/* generate sequence display extension (6.2. 2.4, 6.3.6) 

* 

* content not yet user setable 
*/ 

void putseqdispextO 

{ 

alignbitsO; 

putbits(EXT_START_CODE,32); f* extension_start_code */ 
putbits(DISP_ID,4); /* extension_start_code_identifier */ 


putbits(video_format,3); /* video_format */ 
putbits(l.l); /* colour_description */ 
putbits(color_primaries,8); /* colourjjrimanes */ 
putbits(transfer_characteristics,8); /* transfer_characteristics */ 
putbits(matrix_coefficients,8); /* matrix_coefficients */ 
putbits(display_horizontal_size,14); /* display_horizontal_size */ 
putbits(l,l); /* maxker_bit */ 

putbits(display_vertical_size,14); /* display_vertical_size */ 

} 

/* output a zero terminated string as user data (6.2.2.2.2, 6.3.4. 1) 

* 

* string must not emulate start codes 

*/ 

void putuserdata(char *userdata) 

{ 

alignbits(); 

putbits(USER_START_CODE,32); /* user_data_start_code */ 
while (*userdata) 
putbits(*userdata++,8), 

} 

/* generate group of pictures header (6.2 2.6, 6.3.9) 

* 

* uses tcO (timecode of first frame) and frameO (number of first frame) 
*1 

void putgophdr(int frame, int closed_gop) 

{ 

int tc; 
alignbitsO; 

putbits(GOP_START_CODE,32); /* group start code */ 

tc = frametotc(tcO+frame); 

putbits(tc,25); /* time_code *1 

putbits(closed _gop, 1); /* closed_gop */ 

putbits(0,l); /* broken link */ 

} 

/* convert frame number to time code 

♦ 

* drop_frame not implemented 

*/ 

static int frametotc(int frame) 

{ 

int fps, pict, sec, minute, hour, tc; 

fps = (int)(ffame_rate+0.5); 
pict = firame%fps; 
frame = (firame-pict)/fps; 
sec = frame%60; 
frame = (frame-sec)/60; 
minute = frame%60; 
frame = (frame-minute)/60; 
hour = frame%24; 


tc = (hour«19) | (minute«13) J (1«12) ( (sec«6) | pict; 


return tc; 

} 

/* generate picture header (6.2.3, 6.3.10) */ 
void putpicthdr() 

{ 

alignbits(); 

putbits(PICTURE_START_CODE,32); /* picture start code */ 
calc_vbv_delay(); 

putbits(temp_ref,10); I* temporal_reference */ 
putbits(pict_type,3); /* picture coding type */ 
putbits(vbv_delay, 16); /* vbv delay */ 

if (pict_type=P_TYPE || pict_type=B_TYPE) 

{ 

putbits(0,l); /* full_pe!_forward_vector */ 
if (mpegl) 

putbits(forw_hor_f_code,3); 

else 

putbits(7,3); /* forward_f_code */ 

} 

if (pict_type==B_TYPE) 

{ 

putbits(O.l); /* fulljjel_backward_vector */ 
if (mpegl) 

putbits(back_hor_f_code,3); 

else 

putbits(7,3); /* backward_f_code */ 

} 

putbits(0,l); /* extra_bit_picture */ 

} 

/* generate picture coding extension (6.2.3. 1, 6.3. 1 1) 

* 

* composite display information (v_axis etc.) not implemented 

*! 

void putpictcodextO 

{ 

alignbitsO; 

putbits(EXT_START_CODE,32); /* extension_start_code */ 
putbits(CODING_ID,4); /* extension_start_code_identifier */ 
putbits(fonv_hor_f_code,4); I* forward_horizontal_f_code *1 
putbits(forw_vert_f_code,4); /* for\vard_vertical_f_code */ 
putbits(back_hor_f_code,4); /* backward_horizontal_f_code */ 
putbits(back_vert_f_code,4); /* backward_vertical_f_code */ 
putbits(dc_prec,2); /* intra_dc_precision *1 
putbits(pict_struct,2); /* picture stmcture */ 

putbits((pict_struct=FRAME_PlCTURE)?topfirst:0,l); /* top_field_first */ 

putbits(frame_pred_dct,l); /* ffame_pred frame_dct */ 

putbits(0,l); /* concealment_motion_veaors - currently not implemented */ 


putbits(q_scale_type,l); /* q_scale_type */ 
putbits(intravlc,l), /* intra_vlc_format */ 
putbits(altscan,l); I* altematescan */ 
putbits(repeatfirst,l), /* repeat_first_field *1 
putbits(prog_firame,l); /* chroma_420_type */ 
putbits(prog_frame,l); I* progressiveframe */ 
putbits(0,l); /* compositedisplayflag */ 

} 

/* generate sequence_end_code (6.2.2) */ 
void putseqendO 
{ 

alignbits(); 

putbits(SEQ_END_CODE,32), 

} 

I* initialize buffer, call once before first putbits or alignbits *1 
void initbits() 

{ 

outcnt = 8; 
bytecnt = 0; 

} 

/* write rightmost n (0<=n<=32) bits of val to outfile */ 
void putbits(int val, int n) 

{ 

int i; 

unsigned int mask; 

mask = 1 « (n-1); /* selects first (leftmost) bit */ 

for (i=0; i<n; i++) 

{ 

outbfr «= 1; 

if (val & mask) 
outbfr|= 1; 

mask »= 1; /* select next bit */ 

outcnt—; 

if (outcnt=0) /* 8 bit buffer full */ 

{ 

putc(outbff, outfile); 
outcnt = 8; 
bytecnt++; 

} 

} 


/* zero bit stuffing to next byte boundary (5.2.3, 6.2 1) */ 
void alignbitsO 
{ 

if (outcnt! =8) 


putbits(0,outcnt); 


/* return total number of generated bits */ 
int bitcountQ 
{ 

return 8*bytecnt + (8-outcnt); 

} 

/* generate variable length code for luminance DC coefficient */ 
void putDClum(int val) 

{ 

putDC(DClumtab,val); 

} 

/* generate variable length code for chrominance DC coefficient ♦/ 
void putDCchrom(int val) 

{ 

putDC(DCchromtab,val); 

> 

/* generate variable length code for DC coefficient (7.2. 1) */ 
static void putDC(sVLCtable *tab, int val ) 

{ 

int absval, size; 

absval = (val<0) ? -val : val; /* abs(val) */ 

if (absval>2047 || (mpegl && absval>255)) 

{ 

/* should never happen */ 

sprintf(errortext,"DC value out of range (%d)\n",val); 
error(errortext); 

} 

/* compute dct_dc_size */ 
size = 0; 

while (absval) 

{ 

absval »= 1; 
size++; 

} 

I* generate VLC for dct dc size (Table B-12 or B-13) */ 
putbits(tab[size].code,tab[size] . len); 

/* append fixed length code (dc_dct_differential) */ 
if (size!=0) 

{ 

if (val>=0) 
absval = val; 
else 

absval = val + (l«size) - 1; /* val + (2 A size) - 1 */ 


putbits(absval,size); 

} 

} 

/* generate variable length code for first coefficient 
* of a non-intra block (7.2. 2. 2) */ 
void putACfirst(int run, int val) 

{ 

if (run=0 && (val=l || val=-l)) /* these are treated differently */ 
putbits(2|(val<0),2); /* generate 'Is' (s=sign), (Table B-14, line 2) */ 
else 

putAC(run,val,0); /* no difference for all others *1 

) 

/* generate variable length code for other DCT coefficients (7.2.2) */ 
void putAC(int run, int signedjevel, int vlcformat ) 

{ 

int level, len; 

VLCtable *ptab; 

level = (signed_level<0) ? -signedjevel : signedjevel; /* abs(signedjevel) V 
/* make sure ran and level are valid */ 

if (run<0 || run>63 || level=0 || level>2047 || (mpegl && level>255)) 

{ 

sprintf(errortext,"AC value out of range (nm=%d, signed Jevel=%d)\n", 
run, signedjevel); 
error(errortext); 

} 

len = 0; 

if (run<2 && level<41) 

{ 

/* vlcformat selects either of Table B-14 / B-15 */ 
if (vlcformat) 

ptab = &dct_code_tabla[run] [level-1]; 
else 

ptab = &dct_code_tabl[run][level-l]; 
len = ptab->len; 

} 

else if (run<32 && level<6) 

{ 

/* vlcformat selects either of Table B-14 / B-15 */ 
if (vlcformat) 

ptab = &dct_code_tab2a[run-2] [level- 1|; 
else 

ptab = &dct_code_tab2[run-2][level-l]; 
len = ptab->len; 

} 


if (len !=0) /* a VLC code exists */ 


{ 

putbits(ptab->code,len); 
putbits(signed_leveI<0,l); /* sign */ 

} 

else 

{ 

/* no VLC for this (run, level) combination: use escape coding (7. 2.2.3) */ 
putbits(ll,6); /* Escape */ 
putbits(run,6); /* 6 bit code for run */ 
if (mpegl) 

{ 

/* ISO/IEC 1 1 172-2 uses a 8 or 16 bit code *1 
if (signed_level>127) 
putbits(0,8); 
if (signed_level<-127) 
putbits( 128,8); 
putbits(signed_level,8); 

} 

else 

{ 

/* ISO/IEC 13818-2 uses a 12 bit code, Table B-16 */ 
putbits(signed_level, 12); 

} 

} 

} 

/* generate variable length code for macroblock_address_increment (6.3.16) ♦/ 
void putaddrinc(int addrinc) 

{ 

while (addrinc>33) 

{ 

putbits(0x08,l 1); /* macroblockescape */ 
addrinc— 33; 

} 

putbits(addrinctab[addrinc- 1 ] .code,addrinctab[addnnc- 1 ] . len); 

} 

/* generate variable length code for macroblock type (6.3. 16.1) */ 
void putmbtype(int pict_type, int mb_type) 

{ 

putbits(mbtypetab[pict_type- 1 ] [mb_type] .code, 
mbtypetab[pict_type-l][mb typej.len); 

> 

/* generate variable length code for motion code (6.3.16.3) */ 
void putmotioncode(int motioncode) 

{ 

int abscode; 

abscode = (motion_code>=0) ? motion code : -motion_code; I* abs(motion_code) */ 
putbits(motionvectab[abscode].code,motionvectab[abscode].len); 
if (motion_code!=0) 

putbits(motion_code<0,l); /* sign, 0=positive, l=negative */ 


} 


/* generate variable length code for dmvector[t] (6 3. 16 3), Table B-l 1 */ 
void putdmv(int dmv) 

{ 

if (dmv=0) 
putbits(0,l); 
else if (dmvX)) 
putbits(2,2); 
else 

putbits(3,2); 

} 

/* generate variable length code for coded_block_pattem (6.3. 16.4) 

* 

* 4:2:2, 4:4:4 not implemented 
*/ 

void putcbp(int cbp) 

{ 

putbits(cbptable[cbp] code,cbptable[cbp].len); 

} 

/* rate control variables */ 
int Xi, Xp, Xb, r, dOi, dOp, dOb; 
double avg_act; 
static int R, T, d; 
static double actsum; 
static int Np, Nb, S, Q; 
static int prev mquant; 

void rc_init_seq() 

{ 

I* reaction parameter (constant) */ 

if (r==0) r = (int)floor(2.0"'bit_rate/ffame_rate + 0.5); 

/* average activity */ 
if (avg_act=0.0) avg act = 400 0; 

/* remaining # of bits in GOP */ 

R = 0; 

/* global complexity measure */ 
if (Xi=0) Xi = (int)floor(160.0*bit_rate/l 15.0 + 0.5); 
if (Xp=0) Xp = (int)floor( 60.0*bit_rate/115.0 + 0.5); 
if (Xb=0) Xb = (int)floor( 42.0*bit_rate/l 15.0 + 0.5), 

/* virtual buffer fullness */ 
if (d0i=0) dOi = (int)floor(10.0*r/31 0 + 0.5); 
if (d0p=0) dOp = (int)floor(10.0*r/31.0 + 0.5), 
if (d0b=0) dOb = (int)floor(1.4*10.0*r/31.0 + 0.5), 


void rc_init_GOP(int np, int nb ) 


{ 

R += (int) floor((l + np + nb) * bit_rate / frame_rate + 0.5); 
Np = fieldpic ? 2*np+l : np; 

Nb = fieldpic ? 2*nb ■ nb; 


/* Note: we need to substitute K for the 1.4 and 1.0 constants -- this can 
be modified to fit image content */ 

/* Step 1: compute target bits for current picture being coded */ 
void rc_init_pict(unsigned char *firame ) 

{ 

double Tmin; 

switch (pict_type) 

{ 

case 1_TYPE: 

T = (int) floor(R/( 1 .0+Np*Xp/(Xi* 1 .0)+Nb*Xb/(Xi* 1.4)) + 0.5); 

d = dOi; 

break; 

case PTYPE: 

T = (int) floor(R/(Np+Nb*1.0*Xb/(1.4*Xp)) + 0.5); 

d = dOp; 

break; 

case BTYPE: 

T = (int) floor(R/(Nb+Np*1.4*Xp/(1.0*Xb)) + 0.5); 

d = dOb; 

break; 

} 

Tmin = (int) floor(bit_rate/(8.0*frame_rate) + 0.5); 

if (T<Tmin) 

T = Tmin; 

S = bitcountO; 

Q = 0; 

calc_actj(frame); 
actsum = 0.0; 


static void calc_actj(unsigned char *frame ) 

{ 

int ij,k; 

unsigned char *p; 
double actj.var; 

k = 0; 

for (j=0; j<height2; j+=16) 
for (i=0; i<width; i+=16) 



{ 

p = frame + ((pict_struct==BOTTOM_FIELD)?width 0) + i + width2*j; 

/* take minimum spatial activity measure of luminance blocks */ 

actj = var_sblk(p,width2); 

var = var_sblk(p+8,width2); 

if (var<actj) actj = var; 

var = var_sblk(p+8*width2,width2); 

if (var<actj) actj = var; 

var = var_sblk(p+8*width2+8,width2); 

if (var<actj) actj = var; 

if (Ifieldpic && !prog_seq) 

{ 

/* field •/ 

var = var_sblk(p,width«l); 

if (var<actj) actj = var; 

var = var_sblk(p+8,width«l); 

if (var<actj) actj = var; 

var = var_sblk(p+width,width«l); 

if (var<actj) actj = var; 

var = var_sblk(p+width+8,width«l); 

if (var<actj) actj = var; 

} 

actj+= 1.0; 


mbinfo[k++].act = actj; 

} 


void rc update_pict() 

{ 

double X; 

S = bitcount() - S; /* total # of bits in picture */ 

R-= S; /* remaining # of bits in GOP */ 

X = (int) floor(S*((0.5*(double)Q)/(mb_width*mb_height2)) + 0.5); 
d+=S-T; 

avgact = actsum/(mb_width*mb_height2); 

switch (pict type) 

{ 

case ITYPE: 

Xi="X; 
dOi = d; 
break; 

case p_TYPE: 

Xp = X, 
dOp = d; 

Np~; 

break; 

case B_TYPE: 


} 


Xb = X; 
dOb = d; 
Nb~; 
break; 

} 


/* compute initial quantization stepsize (at the beginning of picture) */ 
int rc_start_mb() 

{ 

int mquant; 

if (q_scale_type) 

{ 

mquant = (int) floor(2 0*d*31.0/r + 0.5); 

/* clip mquant to legal (linear) range */ 
if (mquantd) mquant = 1; 
if (mquant>112) mquant = 112; 

/* map to legal quantization level */ 

mquant = non_linear_mquant_tabIe[map_non_linear_mquant[mquant]J; 

} 

else 

{ 

mquant = (int) floor(d*31.0/r + 0.5); 
mquant «= 1; 

/* clip mquant to legal (linear) range */ 
if (mquant<2) mquant = 2; 
if (mquant>62) mquant = 62; 

prevmquant = mquant; 

} 

return mquant; 

} 

/* Step 2: measure virtual buffer - estimated buffer discrepancy */ 
int rc_calc_mquant(int j) 

{ 

int mquant; 

double dj, Qj, actj, N actj; 

/* measure virtual buffer discrepancy from uniform distribution model */ 
dj = d + (bitcount()-S) - j*(T/(mb_width*mb_height2)); 

/* scale against dynamic range of mquant and the bits/picture count */ 

Qj = dj*31.0/r; 


actj = mbinfo(j].act; 
actsum+= actj; 


/* compute normalized activity */ 

Nactj = (2.0*actj+avg_act)/(actj+2.0*avg_act); 

if (q_scale type) 

{ 

/* modulate mquant with combined buffer and local activity measures */ 
mquant = (int) floor(2.0*Qj*N_actj + 0.5); 

/* clip mquant to legal (linear) range */ 
if (mquant<l) mquant = 1, 
if (mquant>l 12) mquant = 1 12, 

I* map to legal quantization level *1 

mquant = non_linear_mquant_table[map_non_linear_mquant[mquant]]; 

} 

else 

{ 

/* modulate mquant with combined buffer and local activity measures */ 
mquant = (int) floor(Qj*N_actj + 0.5); 
mquant «= 1; 


/* clip mquant to legal (linear) range */ 
if (mquant<2) mquant = 2; 
if (mquant>62) mquant = 62; 

/* ignore small changes in mquant */ 

if (mquant>=8 && (mquant-prev_mquant)>=-4 && (mquant-prev_mquant)<=4) 
mquant = prevmquant; 

prevmquant = mquant; 

} 

Q+= mquant; /* for calculation of average mquant */ 
return mquant; 

} 

/* compute variance of 8x8 block */ 

static double var_sblk(unsigned char *p, int lx) 

{ 

int i, j; 

unsigned int v, s, s2; 

s = s2 = 0; 

for (j=0; j<8; j++) 

{ 

for (i=0, i<8; i++) 

{ 

v = *p++; 
s+= v; 
s2+= v*v; 

} 

p+= lx - 8; 


} 


return s2/64.0 - (s/64.0)*(s/64.0), 

} 

/* VBV calculations 
* 

* generates warnings if underflow or overflow occurs 
*/ 

/* vbv_end_of_picture 

* 

* - has to be called directly after writing picture_data() 

* - needed for accurate VBV buffer overflow calculation 

* - assumes there is no byte stuffing prior to the next start code 
*/ 

static int bitcnt_EOP; 

void vbv_end_of_picture() 

{ 

bitcnt_EOP = bitcountO; 

bitcnt_EOP = (bitcnt_EOP + 7) & -~7; /* account for bit stuffing */ 

} 

/* calcvbvdelay 

* 

* has to be called directly after writing the picture start code, the 

* reference point for vbv delay 
*1 

void calc_vbv_delay() 

{ 

double picture_delay; 

static double next_ip_delay; /* due to frame reordering delay */ 
static double decoding time; 

/* number of 1/90000 s ticks until next picture is to be decoded */ 
if (pict_type — BTYPE) 

{ 

if (prog_seq) 

{ 

if (irepeatfirst) 

picture_delay = 90000.0/frame_rate, /* 1 frame */ 
else 
{ 

if (Itopfirst) 

picture_delay = 90000. 0 *2 . 0/frame_rate; /* 2 frames */ 
else 

picture_delay = 90000.0*3. 0/frame_rate; /* 3 frames */ 

} 

} 

else 

{ 


/* interlaced */ 
if (fieldpic) 

picturedelay = 90000.0/(2.0*frame_rate); /* 1 field */ 
else 
{ 

if (Irepeatfirst) 

picture delay = 90000.0*2.0/(2.0*frame_rate); /* 2 fids */ 
else 

picture delay = 90000.0*3. 0/(2. 0*frame rate); /* 3 fids */ 

} 

} 

} 

else 

{ 

/* I or P picture */ 
if (fieldpic) 

{ 

if(topfirst=(pict_struct==TOP_FIELD)) 

{ 

/* first field */ 

picture_delay = 90000.0/(2. 0*frame_rate); 

} 

else 

{ 

/* second field */ 

/* take frame reordering delay into account */ 
picturedelay = nextipdelay - 90000.0/(2 0*frame_rate); 

} 

} 

else 

{ 

/* frame picture */ 

/* take frame reordering delay into account*/ 
picture_delay = next_ip_delay; 

} 

if (ifieldpic || topfirsti=(pict_struct— TOP_FIELD)) 

{ 

/* frame picture or second field */ 
if (prog_seq) 

{ 

if (Irepeatfirst) 

next_ip_delay = 90000.0/frame_rate; 
else 
{ 

if (Itopfirst) 

next_ip_delay = 90000.0*2. 0/frame_rate; 
else 

nextipdelay = 90000.0*3. 0/frame_rate; 

} 

} 

else 

{ 

if (fieldpic) 


next_ip_delay = 90000.0/(2.0*frame_rate); 
else 
{ 

if (! repea tfirst) 

next_ip_delay = 90000.0*2. 0/(2. 0*frame_rate); 
else 

nextipdelay = 90000. 0*3. 0/(2. 0*frame_rate); 

} 

} 

} 


if (decoding_time==0.0) 

{ 

/* first call of calc_vbv_delay */ 

I* we start with a 7/8 filled VBV buffer (12.5% back-off) */ 
picture_delay = ((vbv_buffer_size*16384*7)/8)*90000.0/bit_rate; 
if (fieldpic) 

next_ip_delay = (int)(90000.0/frame_rate+0.5); 

) 

/* VBV checks */ 

/* check for underflow (previous picture) */ 

if (llowdelay && (decoding_time < bitcnt_EOP*90000.0/bit_rate)) 

{ 

/* picture not completely in buffer at intended decoding time */ 
if (Iquiet) 

fprintf(stderr,"vbv_delay underflow! (decoding_time=%. If, t_EOP=%. lfVn)", 
decodingtime, bitcnt_EOP*90000.0/bit_rate); 


/* when to decode current frame */ 
decodingtime += picture_delay; 

/* warning: bitcount() may overflow (e.g. after 9 min. at 8 Mbit/s */ 
vbv_delay = ( unsigned int)(decoding_time - bitcount()*90000,0/bit_rate); 

/* check for overflow (current picture) */ 
if ((decoding_time - bitcnt_EOP* 90000.0/bitrate) 

> (vbv_buffer_size*16384)*90000.0/bit rate) 

{ 

if (Iquiet) 

fprintf(stderr,"vbv_delay overflow 1 ^"); 

} 

if (vbv_delay<0) 

{ 

if (Iquiet) 

fprintf(stden,"vbv_delay underflow. %d\n n ,vbv_delay); 
vbvdelay = 0; 

} 


if (vbv_delay>65535) 


{ 

if (Iquiet) 

fprintf(stderr,"vbv_delay overflow: %d\n",vbv_delay); 
vbv_delay = 65535, 

} 

} 

/* Test Model 5 quantization 
* 

* this quantizer has a bias of 1/8 stepsize towards zero 

* (except for the DC coefficient) 

*/ 

int quant_intra(short *src, short *dst, int dc_prec, 
unsigned char *quant_mat, int mquant ) 

{ 

int i; 

int x, y, d; 
x = src[0]; 

d = 8»dc_prec; /* intra_dc_mult */ 

dst[0] = (x>=0) ? (x+(d»l))/d : -((-x+(d»l))/d); /* round(x/d) */ 


for (i=l; i<64; i++) 

{ 

x = src[i]; 
d = quant_mat[i); 

y = (32*(x>=0 ? x : -x) + (d»l))/d; /* round( 3 2 * x/quant_mat) */ 
d = (3*mquant+2)»2; 

y = (y+d)/(2*mquant); /* (y+0.75*mquant) / (2*mquant) */ 

/* clip to syntax limits */ 
if (y > 255) 

{ 

if (mpegl) 
y = 255; 

else if (y > 2047) 
y = 2047; 

} 


dst[i] = (x>=0) ? y : -y; 

#if 0 

/* this quantizer is virtually identical to the above *1 
if (x<0) 
x = -x; 

d = mquant*quant_mat[i]; 
y = (16*x + ((3*d)»3))/d; 
dst[i] = (src[i]<0) ? -y : y; 

#endif 

} 

return 1; 

} 


int quant_non_intra(short *src, short *dst. 

unsigned char *quant_mat, int mquant ) 

{ 

int i; 

int x, y, d; 
int nzflag; 


nzflag = 0; 


for (i=0; i<64; i++) 

{ 

x = src[i]; 
d = quant_mat[i]; 

y = (32*(x>=0 ? x : -x) + (d»l))/d; /* round(32*x/quant_mat) */ 
y /= (2*mquant); 

/* clip to syntax limits */ 
if (y > 255) 

{ 

if (mpegl) 
y = 255; 

else if (y > 2047) 
y = 2047; 

} 

if ((dst[i] = (x>=0 ? y : -y)) != 0) 
nzflag=l; 

} 

return n 2 flag; 

} 

/* MPEG-2 inverse quantization */ 
void iquant_intra(short *src, short *dst, int dc_prec, 
unsigned char *quant_mat, int mquant ) 

{ 

int i, val, sum; 
if (mpegl) 

iquantl_intra(src,dst,dc_prec,quant_mat,mquant); 

else 

{ 

sum = dst[0] = src[0] « (3-dc_prec), 
for (i=l; i<64; i++) 

{ 

val = (int)(src[i]*quant_mat[i]*mquant)/16; 

sum+= dst[i] = (val>2047) ? 2047 : ((val<-2048) ? -2048 : val); 

} 

/* mismatch control */ 
if ((sum&l)=0) 
dst[63] A = 1; 

} 

} 



void iquant_non_mtra(short *src, short *dst, 

unsigned char *quant_mat, int mquant ) 

{ 

int i, val, sum; 
if (mpeg 1) 

iquant 1 _non_intra(src,dst,quant_mat, mquant); 
else 
{ 

sum = 0; 

for (i=0; i<64; i++) 

{ 

val = src[t]; 
if (val*=0) 

val = (int)((2*val+(val>0 ? 1 : -l))*quant_mat[i]*mquant)/32; 
sum+= dst[i] = (val>2047) ? 2047 : ((val<-2048) ? -2048 • val); 

} 

/* mismatch control */ 
if ((sum& 1 ) 0) 

dst[63] A = 1, 

} 

} 

/* MPHG-1 inverse quantization */ 
static void iquant l_intra(short *src, short *dst, int dc_prec, 
unsigned char ‘quant mat, int mquant ) 

{ 

int i, val; 

dst[0] = src[0] « (3-dc_prec); 
for (i=l; i<64; i++) 

{ 

val = (int)(src[i]*quant_mat[i]*mquant)/16; 

/* mismatch control */ 
if ((val&l)=0 && val*=0) 
val+= (val>0) ? -1 : 1; 

/* saturation */ 

dst[i] = (val>2047) ? 2047 • ((val<-2048) ? -2048 : val), 

} 

} 

static void iquant l_non_intra(short *src, short *dst, 

unsigned char *quant_mat, int mquant ) 

{ 

int i, val; 

for (i=0; i<64; i++) 

{ 

val = src[i); 
if (val!=0) 


{ 

val = (int)((2*val+(val>0 ? I : -l))*quant_mat[i]*mquant)/32; 

/* mismatch control */ 
if ((val&l)=0 && val!=0) 
val+= (val>0) ? -1 : 1; 

} 

/* saturation */ 

dst[i] = (val>2047) ? 2047 : ((val<-2048) ? -2048 : val), 

} 

} 



/********************************/ 
/* */ 

/* mO.c residing at Root */ 

/* */ 

/********************************/ 


#include 

#include 

#include 

#include 

#mclude 

#include 

^include 

^include 

^include 

#include 

#include 


"c:\tc2v2\dos.h" 

"c:\tc2v2\float.h" 

"c:\tc2v2\math.h" 

"c:\tc2v2\ctype.h" 

"c:\tc2v2\stdio.h" 

"c:\tc2v2\stdlib.h" 

"c:\tc2v2\string.h" 

"c:\tc2v2\chan.h" 

"c:\tc2v2\thread.h" 

"c:\tc2v2\time.h" 

"c:\tc2v2\ermo.h" 


#define GLOBAL /* used by global, h */ 

^include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 
include "c:\nick\h\global.h M 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

char *neworg[l]; 

void main(argc,argv,envp ) ins,in_ports,outs,out_ports) 
int argc, ins, outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*out_ports[]; 

{ 

if (argc!=2) 

{ 

printff'Usage: mpeg2encode in.par\n"); 
exit(O); 

} 

/* read parameter file */ 
readparmfile(argv[ 1 ]), 

initO; 

putseqO; 


void error(char *text) 

{ 

fprintf(stderr,text); 

putcCVn'.stderr); 

exit(l); 

} 


static void readparmfile(char *fname) 

{ 

int i; 

int h,m,s,f; 

FILE *fd; 
char line[256]; 

int r,Xi,Xb,Xp,dOi,dOp,dOb; /* rate control */ 
double avg act; /* rate control */ 

sprintf(line,"c:\\nick\\par\\%s",fnajne); 
if (!(fd = fopen(line,"r"))) 

{ 

sprintf(enortext, "Couldn't open parameter file %s",line); 
error(errortext); 

} 

fgets(id_string,254,fd); 
fgets(line,254,fd); sscanf(line,"%s",tplorg); 
fgets(line,254,fd); sscanf(line,"%s",tplref); 
fgets(line,254,fd); sscanf(line,"%s",iqname); 
fgets(line,254,fd); sscanf(line,"%s",niqname); 
fgets(line,254,fd); sscanf(line,"%s",statname), 
fgets(line,254,fd); sscanf(line,"%d",&inputtype); 
fgets(line,254,fd); sscanf(line s "%d",&nframes); 
fgets(line,254,fd); sscanf(line, "%d",&frameO); 
fgets(line,254,fd); sscanf(line ) "%d %d:%d:%d",&h,&m,&s ) &0; 
fgets(line,254,fd); sscanf(line J "%d",&N); 
fgets(line,254,fd); sscanf(line,"%d",&M); 
fgets(line,254,fd); sscanf(line,"%d",&mpegl); 
fgets(line,254,fd); sscanf(line,"%d",&fieldpic); 
fgets(line,254,fd); sscanf(hne,"%d",&horizontal_size); 
fgets(line,254,fd); sscanf(line,"%d",&vertical_size); 
fgets(line,254,fd); sscanf(line, , '%d",&aspectratio); 
fgets(line,254,fd); sscanf(line > "%d",&frame_rate_code); 
fgets(line,254 ) fd), sscanf(hne > "%lf',&bit_rate); 
fgets(line,254,fd); sscanf(line > "%d",&vbv_buffer_size); 
fgets(line,254,fd); sscanf(line,"%d",&low_delay); 
fgets(line,254,fd); sscanf(line > "%d",&constrparms); 
fgets(]ine,254,fd); sscanf(line,"%d",&profile); 
fgets(line,254,fd); sscanf(line,"%d",&level); 
fgets(line,254,fd); sscanf(line,"%d”,&prog_seq); 
fgets(line,254,fd); sscanf(line > "%d",&chroma_format); 
fgets(line,254,fd); sscanf(line,"%d",&video_format); 
fgets(line,254,fd); sscanf(line,"%d" > &color_primaries); 
fgets(line,254,fd); sscanf(line,"%d",&transfer_characteristics); 
fgets(line,254,fd); sscanf(lme, "%d",&matrix_coefficients); 
fgets(line,254 > fd); sscanf(line,"%d",&display_horizontal_size); 
fgets(line,254,fd); sscanf(line,"%d",&display_vertical_size); 
fgets(line,254,fd); sscanf(line,"%d",&dc_prec); 
fgets(line,254,fd); sscanf(line,"%d",&topfirst); 
fgets(line,254 > fd); sscanf(line,"%d %d %d", 
frame_pred_dct_tab,frame_pred_dct_tab+ 1 ,frame_pred_dct_tab+2); 



fgets(line,254,fd); sscanf(line,"%d %d %d", 
conceal_tab,conceal_tab+ 1 ,conceal_tab+2); 


fgets(line,254,fd); sscanf(line,"%d %d %d", 
qscale_tab,qscale_tab+ 1 ,qscale_tab+2); 

fgets(line,254,fd); sscanf(line,"%d %d %d", 
intravlc_tab,intravlc_tab+ 1 ,intravlc_tab+2); 
fgets(line,254,fd); sscanf(line,"%d %d %d", 
altscan_tab,altscan_tab+ 1 ,altscan_tab+2); 
fgets(line,254,fd); sscanf(line,"%d",&repeatfirst); 
fgets(line,254,fd); sscanf(line,"%d",&prog_frame); 

/* intra slice interval refresh period */ 
fgets(line,254,fd); sscanf(line,”%d",&P); 
fgets(line,254,fd); sscanf(hne,"%d",&r); 
fgets(line,254,fd); sscanf(line,"%lf',&avg_act); 
fgets(line,254,fd); sscanf(Iine,"%d",&Xi); 
fgets(line,254,fd); sscanf(line,"%d",&Xp); 
fgets(line,254,fd); sscanf(line,"%d",&Xb); 
fgets(line,254,fd); sscanf(line,"%d",&dOi); 
fgets(line,254,fd); sscanf(line,"%d",&dOp); 
fgets(line,254,fd); sscanf(line,”%d",&dOb), 

if (N<1) error("N must be positive"); 

if (M<1) error("M must be positive"), 

if (N%M != 0) error("N must be an integer multiple of M"); 

motiondata = (struct motion_data *)malloc(M*sizeof( struct motiondata)); 
if (Imotiondata) error("malloc failed motion_data\n"); 

for (i=0; i<M; i++) 

{ 

fgets(line,254,fd); 
sscanf(line,"%d %d %d %d", 

&motion_data[i] forvv_hor_f_code, &motion_data[i].forw_vert_f_code, 
&motion_data[i].sxf, &motion_data[i].syf); 

if (i!=0) 

{ 

fgets0ine,254,fd); 
sscanf(line,"%d %d %d %d", 

&motion_data[i].back_hor_f_code, &motion_data[i].back_vert_f_code, 
&motion_data[i].sxb, &motion_data[i].syb); 

} 

} 

fclose(fd); 

/* make flags boolean (x!=0 -> x=l) */ 
fleldpic = Hfieldpic; 
prog_seq = ! Iprogseq; 
topfirst = ! Itopfirst; 


for (i=0, i<3; i++) 


{ 

frame _pred_dct_tab[i] = !!frame_pred_dct_tab[i]; 
conceal_tab[i] = !!conceal_tab[i], 
qscaie_tab[i] = !!qscale_tab[i]; 
intravlc_tab[i] = !!intravlc_tab[i]; 
altscan_tab[i] = !!altscan_tab[i]; 

} 

repeatfirst = Hrepeatfirst; 
prog_frame = ! Iprogffame; 

chan_out_word(M,Link 1 Output); 
chan_out_word(M,Link20utput); 

chan_out_word(nframes,Link lOutput); 
chan_out_word(nframes,Link20utput); 


static void initQ 

{ 

int i, size, a; 

/* round picture dimensions to nearest multiple of 16 or 32 */ 
mbwidth = (horizontal_size+15)/16; 

mb_height = prog seq ? (vertical_size+15)/16 ; 2*((vertical_size+3 1)/32); 
mb_height2 = fieldpic ? mb_height»l : mb_height; /* for field pictures V 
width = 16*mb_width; 
height = 16*mb_height; 

height2 = fieldpic ? height»l : height; 
width2 = fieldpic ? width«l : width; 

size = width*height ; 

if (!(newrefframe[0] = (char *)malloc(size))) 
error("malloc failed for newrefframe\n"); 
if (!(oldrefframe[0] = (char *)malloc(size)» 
error("malloc failed oldrefframeVn"); 
if (!(neworgframe[0] = (char *)malloc(size))) 
error("malloc failed for neworgframe\n"); 
if (!(oldorgftame[0] = (char *)malloc(size))) 
error("malloc failed\n"); 
if (!(neworg[0] = (char *)malloc(size))) 
error("malloc failed for auxorgframe\n”); 

mbinfo = (struct mbinfo *)malloc(mb_width*mb_height2*sizeof(struct mbinfo)); 
if (Imbinfo) error("malloc failed mbinfo\n”); 

for (a=0; a<mb_width*mb_height2; a ++) 

{ 

mbinfo[a].mb_type = mbinfo[a].motion_type = 0; 

mbinfo[a].MV[0][0][0] = mbinfo[a].MV[0][0][l] = mbinfo[a].MV[0][l][0] = 0; 
mbinfo[a].MV[0][l][l] = mbinfo[a].MV(l][0][0] = mbmfo[a].MV[l][0][l] = 0; 


mbinfo[a] . M V[ 1 ] [ 1 ] [0] = mbinfo[a].MV[l][l][l] = 0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0, 
mbinfo[a].mv_field_sel[l][0] = mbinfo[a] mv_field_sel[l][l] =0, 

} 

chan_out_word(mb_width,Link 1 Output); 
chan_out_word(mb_width,Link20utput); 

chan_out_word(mb_height,LinklOutput); 

chan_out_word(mb_height,Link20utput); 

chan_out_word(mb_height2,LinklOutput); 

chan_out_word(mb_height2,Link20utput); 

chan_out_word(width,Link 1 Output); 
chan_out_word(width,Link20utput); 

chan_out_word(height,Link 1 Output); 
chan_out_word(height,Link20utput), 

chan_out_word(width2,Link 1 Output); 
chan_out_word(width2,Link20utput); 

chan_out_word(height2,Link 1 Output); 
chan_out_word(height2,Link20utput); 

} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fD, n, np, nb; 
int a; 

int sxf, syf, sxb, syb, 

FILE *f_mbi, *f_d, ♦f status; 
char name[128], namel[128J; 
int finished, size; 

size = width*height; 

/* loop through all frames in encoding/decoding order */ 
for (i=0; i<nframes; i++) 

{ 

chan_out_word(i,Link lOutput); 
chan_out_word(i,Link20utput), 

/* read in data from the first program *1 
sprintf(name,tplorg,i); 

sprintf(name 1 , "c:\\nick\\dmot\\%s. sta",name); 
if (!(f_status = fopen(namel,"r"))) 

{ 

sprintffeiTortexV'Couldn't open %s\n",namel); 


error(errortext); 

} 

fscanf(f_status,"%d %d %d %d %d %d %d %d ", 
&pict_type, 

&pict_struct, 

&sxf, 

&sxb, &syb, 

&f, 

&frame_pred_dct); 

fclose(f_statns); 

fprintf(f_time,"frame:%d\n", 0; 

sprintf(name,tplorg,f+frame0); 

sprintf(name 1 ,"c:\\nick\\dmot\\%s.oof ',name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fread(oldorgframe[0], 1 ,size,f_d); 
fclose(f_d); 

sprintf(name 1 , "c:\\nick\\dmot\\%s. nof '.name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fread(neworgframe[0], l,size,f_d); 
fclose(f_d), 

sprintf( name 1 ,"c:\\nick\\dmot\\%s.off", name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

firead(oldrefframe[0], l,size,f_d); 
fclose(f_d); 

sprintf(namel,"c:\\nick\\dmot\\%s.nff",name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fread(newrefframe[0] , 1 ,size,f_d); 
fclose(f_d); 


sprintf(namel,"c:\\nick\\dmot\\%s.nor",name); 



if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

ffead(neworg[0], 1 ,size,f_d); 
fclose(f_d); 

chan_out_word(pict_type,Link 1 Output); 
chan_out_word(picMype,Link20utput); 

chan_out_word(sxf,Link lOutput); 
chan_out_word(sxf,Link20utput); 

chan_out_word(syf,Link 1 Output); 
chan_out_word(syf,Link20utput); 

chan_out_word(sxb,Link 1 Output); 
chan_out_word(sxb,Link2 Output); 

chan_out_word(syb,LinklOutput); 

chan_out_word(syb,Link20utput); 

if ( pict_type != I TYPE ) 

{ 

chan_out_message(size,neworg[0], Link lOutput); 
chan_out_message(size,neworg[0],Link2Output); 

chan_out_message(size,oldorgframe[0], Link lOutput); 
chan_out_message(size,oldorgframe[0],Link2Output); 

chan_out_message(size,oldrefTrame[0], Link lOutput), 
chan_out_message(size,oldrefframe[0],Link2Output); 

if ( pict type != PTYPE ) 

{ 

chan_out_message(size,neworgframe[0],LinklOutput); 

chan_out_message(size,neworgframe[0],Link2Output); 

chan_out_message(size,newrefframe[0],LinklOutput); 

chan_out_message(size,newrefframe[0],Link2Output); 

} 

} 

/* loop through 36 macroblocks of the picture */ 

motion_estimation(oldorgframe[0], neworgframefO], 
oldrefffame[0], newrefframe[0], 
neworgfO], 
sxf, syf, 
sxb, syb, 
mbinfo); 


for (a=36; a<146; a++) 


{ 

chan_in_word(&mbinfo[a] . mb_type,Link 1 Input); 
chan_in_word(&mbinfo[a]. motiontype, Link 1 Input); 
chan_in_word(&mbinfo[a] MV[0] [0] [0],Link 1 Input); 
chan_in_word(&mbinfo[a] .MV[0] [0] [ 1 ],Link 1 Input); 
chan_in_word(&mbinfo[a] .MV[0] [ 1 ] [0], Link 1 Input); 
chan in word(&mbinfo[a].MV[0][ 1 ][ 1 J.Link 1 Input), 
chan_in_word(&mbinfo[a] .MV[ 1] [0] [0], Link 1 Input); 
chan_in_word(&mbinfo[a] .MV[ 1 ] [0] [ 1 ],Link 1 Input); 
chan_in_word(&mbinfo[a J ,MV[ 1 ] [ 1 ] [0],Link 1 Input); 
chan_in_word(&mbinfo[a] MV[ 1 ] [ 1] [ 1 ],Linkl Input); 
chan_in_word(&mbinfo[a] . mv_field_sel[0] [0] ,Link 1 Input); 
chan_in_word(&mbinfo(a] . mv_field_sel [0] [ 1 ] ,Link 1 Input); 
chan_in_word(&mbinfola].mv_field_sel[l][0],LinklInput); 
chan_in_word(&mbinfo[a].mv_field_sel[l][l],LinklInput); 

} 

for (a=146, a<256; a++) 

{ 

chan in \vord(&mbinfo[a].mb_type,Link2Input); 
chan_in_word(&mbinfo[a].motion_type,Link2Input); 
chan_in_word(&mbinfo[a].MV[0][0][0],Link2Input); 
chan_in_word(&mbinfo[a].MV[0][0]fl],Link2Input); 
chan_in_word(&mbinfo[a] .MV[0] [ 1 ] [0],Link2Input); 
chan in word(&mbinfo[a] .MV[0] [ 1 ] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a] .MV[ 1 ] [0] [0] ,Link2Input); 
chan_in_word(&mbinfo[a] MV( 1 ] [0] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a] .MV[ 1 ] [ 1 ] [0],Link2Input); 
chan_in_word(&mbinfo[a] .MV[ 1 ] [ 1 ) [ 1 ) ,Link2Input); 
chan_in_word(&mbinfo[a].mv_field_sel[0][0],Link2Input); 
chan_in_word(&mbinfo[a] . mv_field_sel [0] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a].mv_field_sel[l][0],Link2Input); 
chan_in_word(&mbinfo[a] . mv field sel [ 1 ] [ 1 ],Link2Input); 

} 

/* write data for the third program */ 

sprintf(namel,"c\\mck\\mot_out\\%s.mbi",name); 
if (!(f_ mbi = fopen(name 1 , "h/0"))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel), 
error(errortext); 

} 

for (a=0; a<mb_width*mb_height2; a++) 

{ 

fprintf(f_mbi, "%d %d %d %d %d %d %d %d %d %d %d %d %d %d \n", 

mbinfo[a] . mbtype, 

mbinfo[a] . motiontype, 

mbinfo[a].MV[0][0][0], 

mbinfo[a].MV[0][0][l], 

mbinfo[a].MV[0][l][0], 

mbinfo(a].MV[0][l][l], 

mbinfo[a] .MV[ 1 ] [0] [0] , 


mbinfo[a].MV[l][0][l], 
mbinfo[a].MV[l][l][0], 
mbinfo[a].MV[l][l][l], 
mbinfo[a] . mv_field_sel [0] [0] , 
mbinfo[a] . mv_field_sel [0] [ 1 ] , 
mbinfo[a] . mv_field_sel [ 1 ] [0] , 
mbinfo[a] . mv_field_sel [ 1 ] [ 1 ] ); 
} 

fclose(f_mbi); 


chan_out_word(i,Link 1 Output); 
chan_out_word(i,Link20utput); 

chan_in_word(&finished,Link 1 Input); 
chan_in_vvord(&finished,Link2Input); 

}/*end of mainQ*/ 


static void frame estimate _ANSl_ARGS_((char *org, 
char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep. float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2])); 

static float fullsearch _ANSI_ARGS_((char *org, 
char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
intsy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h. 


float distlim)); 


static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h)), 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
mt hyb, 
char h)); 

static float bdist2 _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char *neworg, 
char *oldref, char *newref, 
char *cur, 
int sxf, 
int syf, 
int sxb, 
int syb, 

struct mbinfo *mbi) 

{ 

int i, j; 

int inun jmin,iminf jminf,iniinr jminr, 
int imint jmint,iminbjminb; 
int imintfjmintf,iminbf jminbf; 
int imintrjntintr,iminbrjminbr; 
float var,vO; 

float dmc,drncf,dmcr,dmci,vmc,vmcf,vmcr,vmci; 
float dmcfield,dmcfieldf,dmcfieldr,dmcfieldi; 
int tsel,bsel,tself,bself,tselr,bselr; 
char ♦mb; 

int imins[21(2] jmins[2][2]; 
int a, ii,ij; 


/* loop through 36 macroblocks of the picture */ 


for(a=0; a<36; a++) 

{ 

jj = a»4; 
ii = a - (jj«4); 
j =ij«4; 
i = ii«4; 


mb = cur + i + width*j; 


var = variance(mb, width); 

/* for I_type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MB_INTRA; 

/* for P type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&imin,&jmm,&imint,&jmint,&iminb,&jminb, 

&dmc,&dmcfield,&tsel,&bsel,iminsjmins); 

/* select between frame and field prediction */ 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MCFRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width, imin& ljmin&l , 16); 

} 

else 

{ 

mbi->motion_type = MC_FIELD; 
dmc = dmcfield; 

vmc = dist2(oldref+(tsel?width:0)+(imint»l)+(width«l)*(jmint»l), 
mb, width« 1 ,imint& 1 Jmint& 1,8); 

vmc+= dist2(oldref+(bsel?width:0)+(iminb» 1 )+(width« 1 )*(jminb» 1 ), 
mb+width, width« 1 ,iminb& 1 jminb& 1 ,8); 

} 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

vO = dist2(oldref+i+width*j,mb, width, 0,0, 16); 
if (4*v0>5*vmc && v0>=9*256) 

{ 

/* use MC */ 
var = vmc; 

mbi->mb_type = MBFORWARD; 
if (mbi->motion_type— MC_FRAME) 


{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0][l] = jmin - 0«1); 

} 

else /* if (mbi->motion_type=MC_FIELD) */ 

{ 

/* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - 0«1); 
mbi->MV[l][0][0] = iminb - (i«l); 
mbi->MV[l][0][lj = (jminb«l) - Ci << l); 
mbi->niv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

> 

} 

else 

{ 

/* No-MC */ 
var = vO; 

mbi->mb_type = 0; 
mbi->motion_type = MCFRAME; 
mbi->MV[0] [0] [0] = 0; 
mbi->MV[0][0][l] = 0; 

} 

} 

} 


/* for B_type pictures *1 

if (pict_type=B TYPE) 

{ 

/* forward prediction */ 
frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf,&dmcfieldf,&tself,&bself,iminsjmins); 

I* backward prediction V 
frame_estimate(neworg,newref,mb,ij,sxb,syb, 

&iminr,&jminr,&imintr,&jmintr,&iminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

/* calculate interpolated distance */ 

I* frame */ 

dmci = bdist 1 (oldref+(iminf» l)+width*(jminf» 1 ), 
newref+(iminr» 1 )+width*(jminr» 1 ), 
mb,width,immf& 1 jminf& 1 ,iminr&l jminr& 1,16); 

/* top field 

dmcfieldi = bdist 1 (oldref+(imintf» 1 )+(tself?width:0)+(width« 1 )*(jmintf» 1 ), 
newref+(imintr»l)+(tselr?width:0)+(width«l)*(jinintr»l), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1 ,8); 


/* bottom field 


dmcfieldi+= bdist 1 (oldref+(iminbf» 1 )+(bself?width:0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width<<l ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1 ,8); 

/* select prediction type of minimum distance from the 
* six candidates (field/frame * forward/backward/interpolated) 

*/ 

if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

I* frame, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf»l)+width*(jminf»l), 
newref+(iminr» 1 )+width*(jminr» 1 ), 
mb,width,iminf& 1 jminf& 1 ,iminr& 1 jminr& 1 , 1 6); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

/* field, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FIELD; 

vmc = bdist2(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jmintf>>l), 
newref+(imintr» 1 )+(tselr?width:0)+(width« l)*(jmintr» 1 ), 
mb, width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 Jmintr& 1,8); 
vmc+= bdist2(oldref+(iminbf» 1 )+(bself?width:0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1 ,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(iminf» 1 )+width*(jminf»l ),mb, 
width, imi nf& 1 Jminf& 1 , 16); 

} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

/* field, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FIELD; 

vmc = dist2(oldref+(tself?width:0)+(imintf»l)+(width«l)*(jmintf»l), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,8); 

vmc+= dist2(oldref+(bself?width.0)+(iminbf»l)+(width«l)*(jminbf»l), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1 ,8); 

} 

else if (dmcr<dmcfieldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 


vmc = dist2(ne\vref+(iminr»l)+width*(jminr»l),mb, 
width,iminr& 1 jminr&l , 1 6); 

} 

else 

{ 

/* field, backward */ 
mbi->mb_type = MB_BACKWARD; 
mbi->motion_type = MCFIELD; 

vmc = dist2(newref+(tselr?width:0)+(imintr>> 1 )+(width« 1 )*(jmintr» 1 ), 
mb,width« 1 ,immtr& 1 jmintr& 1 ,8); 

vmc+= dist2(newref+(bselr?width:0)+(iminbr» 1 )+(width« l)*(jminbr» 1 ), 
mb+ width, width« 1 ,iminbr& 1 jminbr& 1,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_t>'pe=MC_FRAME) 

{ 

/* forward */ 

mbi->MV[0][0][0] = iminf - (i«l); 
mbi->MV[0][0][l] =jminf - (j«l); 

/* backward *1 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - (j«l); 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward *1 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0][0][l] = (jmintf«l) - (j< < l); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0][l] = (jminbf«l) - 0«1); 
mbi->mv_field_sel[0][0] = tself; 
mbi->mv_field_sel[l][0] =bself; 

/* backward */ 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - (j <<: l); 
mbi->MV[l][l][0] = iminbr - (i«l); 
mbi->MV[l][l][l] = (jminbr«l) - (j«l); 
mbi->mv_field_sel[0][l] = tselr; 
mbi->mv_field_sel[l][l] = bselr; 

} 

} 

} 

mbi->var = var; 


mbi++; 



if (Iquiet) 

{ 

putc(7,stderr); 

filush(stderr); 

} 

} 

if (Iquiet) putc('\n',stdeir); 

} 


static void frame_estimate(char *org, char *ref, 

char *mb. , . 

int i, int j, \ 

int sx, int sy, 
int *iminp, int *jmmp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db,dmint,dminb; 
int imint,iminbjmintjminb; 

I* frame prediction */ 

•dframep ° fullsearch(org,ref,mb,width,ij,sx,sy,16,width,height, 
iminpjminp); 

/* predict top field from top field */ 

dt = fullsearch(org,ref > mb,width«l,ij»l,sx ) sy»l,8,width,height»l, 

&imint,&jmint); 

/* predict top field from bottom field */ 

db = fullsearch(org+width,ref+width,mb,width« 1 ,i j» l,sx,sy» 1 ,8,width,height» 1 , 

&iminb,&jminb); 

imins[0][0] = imint; 
jmins[0)[0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

f* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 

} 

else 

{ 

dmint=db; *imintp=iminb; *jmintp=jmmb; *tselp=l; 

> 


/* predict bottom field from top field */ 


dt = fullsearch(org,ref,mb+width,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 

/* predict bottom field from bottom field */ 
db = fullsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 

imins[0][l] = imint; 
jmins[0][l] = jmint; 
iminsf 1 ][ 1] = iminb; 
jmins[l][l] = jminb; 

/* select prediction for bottom field */ 

if(db<=dt) 

{ 

dminb=db; *iminbp=iminb; *jminbp=jminb; *bselp=l; 

> 

else 

{ 

dminb=dt; *iminbp=imint; *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fullsearch(char *org, char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp) 

{ 

int iJ,iminjmin,ilow,ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 
int ii, jj; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) ilow = 0; 


if (ihigh>xmax-16) ihigh = xmax-16; 

jlow = jO - sy; 
jhigh =j0 + sy; 



if (jlow<0) jlow = 0; 

if (jhigh>ymax-h) jhigh = ymax-h, 

/* full pel search, spiraling outwards */ 

imin = iO; 
jmin =j0; 

dmin = distl(org+imin+lx*jmin,blk, lx, 0,0, h, 65536.0); 

sxy = (sx>sy) ? sx : sy; 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 

j = jo - 1; 

for (k=0; k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = distl(org+i+lx*j,blk,lx,0,0,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l) j++; 
else if (k<6*l) i~; 
else j-; 

} 

} 

/* half pel */ 

dmin = 65536.0; 

imin = imin « 1; 

jmin = jmin « 1; 

ilow = imin - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin>0); 

jhigh = jmin + Onun<((ymax-h)«l)); 

for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i++) 

{ 

ii = i » 1 ; 
ij = j » i ; 

d = distl(ref+ii+lx*jj,blk,lx,i&l j&l,h,dmin), 


if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

*iminp = imin; 
*jminp = jmin; 

return dmin; 

} 


static float distl(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
charh, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0; 

pi = blkl; 
p2 = blk2; 

if (!hx && !hy) 
for (j=0;j<h;j++) 

{ 

if ((v = pi [0] - p2[0])<0) v = -v; s+= (float) v; 
if((v = pl[l] -p2[l])<0) v = -v; s+= (float) v; 
if ((v = pi [2] - p2[2])<0) v = -v; s+= (float) v; 

if ((v = pi [3] -p2[3])<0) v = -v; s+= (float) v, 

if((v = pl[4] - p2[4])<0) v = -v; s+= (float) v; 

if((v = pl[5] -p2[5])<0) v = -v; s+= (float) v; 
if ((v = pi [6] - p2[6])<0) v = -v; s+= (float) v; 
if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v; 
if((v = pl[8] -p2[8])<0) v = -v; s+= (float) v; 

if((v = pl[9] - p2[9])<0) v = -v; s+= (float) v; 

if ((v = pi [ 10] - p2[10])<0) v = -v; s+= (float) v 
if ((v = pi [1 1] - p2[ll])<0) v = -v; s+= (float) v 
if ((v = pi [12] - p2[12])<0) v = -v; s+= (float) v 
if ((v = pi [13] - p2[13])<0) v = -v; s+= (float) v 
if ((v = pi [14] - p2[14])<0) v = -v; s+= (float) v 
if ((v = pi [15] - p2[15])<0) v = -v, s+= (float) v 

if (s >= distlim) 
break; 


pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0;j<h;j++) 

{ 

for (i=0, i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
if (v>=0) s+= (float) v; 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

) 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int)((pl[i]+pla[i]+l)/2-p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl(i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s— (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 


} 


static float dist2(char ♦blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0; 

pi - blkl; 
p2 = blk2; 
if (!hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = pl[i] -p2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for 0=0; j<h; j++) 

{ 

for (i=0, i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+lJ+l)/2- P 2[i]); 
s+= (float) (v*v), 

} 

pl+= lx, 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pla[i]+l) / 2 - p2[i]), 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


else /* if (hx && hy) V 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 

} 

static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
int v; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for 0=0; j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 

if (v>= 0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

p2+= lx-16; 
pf+= lx-16; 


pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 

static float bdist2(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa, *pfb, *pfc, *pba, *pbb, *pbc; 
char i, j; 
intv; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 

- *p2++); 

s+= (float) (v*v); 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s, 

} 


static float variance(char *p, int lx ) 

{ 

char i, j; 
int v; 

float s, s2; 

s = s2 = 0; 

for (j= 0 ;j< 16 ;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

} 

p+= lx- 16; 

} 

s=0.0625*s; 
return s2 - s*s, 

} 


/ft*******************************/ 
/* */ 

/* mO l.c reside at PEI *1 

/* V 

/********************************/ 


include "c:\tc2v2\dos.h" 

#include "c:\tc2v2\float.h" 

#include "c:\tc2v2\math.h" 

#include "c:\tc2v2\ctype.h" 

^include "c:\tc2v2\stdio.h" 

#include "c:\tc2v2\stdlib.h" 

#include "c:\tc2v2\string.h" 

#include "c:\tc2v2\chan.h” 
include "c:\tc2v2\thread.h" 

#include "c:\tc2v2\time.h" 

#include "c:\tc2v2\ermo.h" 

#defme GLOBAL /* used by global.h */ 

^include "c\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

#include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

char *neworg[l]; 

void main(argc,argv,envp, ins, in_ports, outs, out_ports) 
int argc,ins,outs; 
char *argvfl,*envp[]; 

CHAN *in_ports[],*out_ports[]; 

{ 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_out_word(M,Link 1 Output); 
chan_out_word(M,Link20utput); 

chan_in_word(&nframes,LinkOInput); 
chan_out_word(nframes,Link 1 Output); 
chan_out_\vord(nframes,Link20utput); 

init(); 

putseqO; 


} 

static void init() 

{ 

int i, size, a; 

chan_in_word(&mb_width,LinkOInput); 
chan_out_word(mb_width,Lmk 1 Output); 
chan_out_word(mb_width,Link20utput); 




chan_in_word(&mb_height,LinkOInput); 
chan_out_word(mb_height,Link 1 Output); 
chan_out_word(mb_height,Link20utput); 

chan_in_word(&mb_height2,LinkOInput); 
chan_out_word(mb_height2,Link 1 Output); 
chan_out_word(mb_height2,Link20utput); 

chan_in_word(&width,LinkOInput); 
chan_out_word( width, Link 1 Output); 
chan_out_word(width,Link20utput); 

chan_in_word(&height,LinkOInput); 
chan_out_word(height,Link lOutput); 
chan_out_word(height,Link20utput), 

chan_in_word(&width2,LinkOInput); 
chan_out_word(width2, Link lOutput); 
chan_out_word(width2,Link20utput); 

chan_in_word(&height2,LinkOInput); 
chan_out_word(height2,Link 1 Output); 
chan_out_word(height2,Link20utput); 

size = width*height ; 

newreffirame[0] = (char *)malloc(size); 
oldre£Trame[0] = (char *)malloc(size); 
neworgframe[0] = (char *)malloc(size); 
oldorgframe[0] = (char *)malloc(size); 
neworg[0] = (char *)malloc(size); 

mbinfo = (struct mbinfo *)malloc(l 10*sizeof( struct mbinfo)); 

for (a=0; a<110; a++) 

{ 

mbinfo [a], mbtype = mbinfo [a], moti ontype = 0; 

mbinfo[a] . MV[0] [0] [0] = mbinfo[a].MV[0][0][l] = mbinfo[a].MV[0][l][0] = 0; 
mbinfo[a].MV[0][l][l] = mbinfo[a].MV[l][0][0] = mbinfo[a].MV[l][0][l] = 0; 
mbinfo[a].MV[l][l][0] = mbinfo[a].MV[l][l][l] = 0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0; 
mbinfo[a].mv_field_sel[l][0] = mbinfo[a].mv_field_sel[l][l] = 0; 

} 
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void putseqO 

{ 

I* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fO, n, np, nb; 
int a; 




int sxf, syf, sxb, syb; 
int size, loop; 

size = width*height; 


/* loop through all frames m encoding/decoding order *1 
do{ 

chan_in_word(&loop,LinkOInput); 
chan_out_word(Ioop, Link 1 Output); 
chan_out_word(loop,Link20utput); 

if(loop=nframes) break; 

chan_in_word(&pict_type,LinkOInput); 

chan_out_word(pict_type,LinklOutput), 

chan_out_word(pict_type,Link20utput); 

chan_in_word(&sxf,LinkOInput); 
chan_out_word(sxf,Link 1 Output), 
chan_out_word(sxf,Link20utput); 

chan_in_word(&syf,LinkOInput); 
chan_out_word(syf,Link 1 Output); 
chan_out_word(syf,Link20utput); 

chanin _word(&sxb,LinkOInput); 
chan_out_word(sxb,Link 1 Output); 
chan_out_word(sxb,Link20utput); 

chan_in_word(&syb,LinkOInput); 
chan_out_word(syb,Link 1 Output); 
chan_out_word(syb,Link20utput); 

if ( pict type != I TYPE ) 

{ 

chan_in_message(size,neworg[0],LinkOInput); 

chan_out_message(size,neworg[0],LinklOutput); 

chan_out_message(size,neworg[0],Link2Output); 

chan_in_message(size,oldorgframe[0],LinkOInput); 

chan_out_message(size,oldorgframe[0],LinklOutput); 

chan_out_message(size,oldorgframe[0],Link20utput); 

chan_in_message(size,oldrefframe[0],LinkOInput); 

clian_out_message(size,oldrefffame[0],LinklOutput); 

chan_out_message(size,oldre£frame[0],Link2Output); 

if ( pict_type != P TYPE ) 

{ 

chan_in_message(size,neworgframe[0],LinkOInput); 
chan_out_message(size,neworgframe[0],Link 1 Output); 
chan_out_message(size,neworgffame[0],Link2Output), 


chanJn_message(size,newre£frame[0],LinkOInput); 

chan_out_message(size,newrefframe[0],LinklOutput), 

chan_out_message(size,newre£frame[0],Link2Output); 

} 

} 

/* loop through 36 macroblocks of the picture */ 

motion_estimation(oldorgframe[0], neworgframe[0], 
oldrefframe[0], newrefframe[0], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo); 

for (a=36; a<73; a++) 

{ 

chan_in_word(&mbinfo[a] . mb_type, Link 1 Input); 
chan_in_word(&mbinfo[a].motion_type,LinklInput); 
chan _in_\vord(&mbinfo[a] .MV[0] [0] [0],Link 1 Input); 
chan Jn_word(&mbinfo[a].MV[0](0] [l],Linkl Input); 
chan_in_word(&mbinfo[a] .MV[0] [ 1 ] [0],Link 1 Input); 
chan_in_word(&mbinfola] .MV[0] [ 1] [ 1 ] ,Link 1 Input); 
chan Jn_\vord(&mbinfo[a] ,MV[ 1 ] [0] [0] .Link 1 Input); 
chan_in_word(&mbinfo[a] ,MV[ 1 ] [0] [ 1 ] .Link 1 Input); 
chan_in_word(&mbinfo[a] .MV[ 1 ] [ 1 ] [0],Link 1 Input); 
chan_in_word(&mbinfola) . MV[ 1 ] [ 1 ] [ 1 ] .Link 1 Input) ; 
chan_in_word(&mbinfo[a] . mv_field_sel[0] [0] .Link 1 Input); 
chan _in_word(&mbinfo[a] ,mv_field_sel [0] [ 1 ] .Link 1 Input); 
chan _in_word(&mbinfo[a] ,mv_field_sel[ 1 ] [OJ.Link 1 Input); 
chan_in_word(&mbinfo[a] . mv_field_sel [ 1 ] [ 1 ] .Link 1 Input): 
} 

for (a=73; a<l 10; a++) 

{ 

chan_in_word(&mbinfo[a] . mb_type,Link2Input); 
chan_in_word(&mbinfo[a].motion_type,Link2Input); 
chan_in_word(&mbinfo[a].MV[0][0][0],Link2Input); 
chan _in_word(&mbinfo[a] .MV[0] [0] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a] .MV[0] [ 1 ] [0],Link2Input); 
chan_in_word(&mbinfo[a] . MV[0] [ 1 ] [ 1 ] ,Link2 Input); 
chan_in_word(&mbinfo[a].MV[l][0][0],Link2Input); 
chan_in_word(&mbinfo[a] ,MV[ 1 ] [0] [ 1 ],Link2Input); 
chan_in_\vord(&mbinfo[a].MV[l][l][0],Link2Input); 
chan_in_word(&mbinfo[a] ,MV[ 1 ] [ 1 ] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a].mv_field_sel[0]l0],Link2Input) 
chan_in_\vord(&mbinfo{a].mv_field_sel[0][l],Link2Input) 
chan_in_word(&mbinfo[a] mv_field_sel[ 1 ] [0],Link2Input) 
chan_in_word(&mbinfo[a] . mv _field_sel [ 1 ] [ 1 ] ,Link2Input) 
} 


for (a=0; a<l 10; a++) 

{ 


chan_out_word(mbinfo[a].mb_type,LinkOOutput); 
chan_out_word(mbinfo[a] motiontype.LinkOOutput); 
chanout _word(mbinfo[a] .M V[0] [0] [0] ,Li nkOOutput) ; 
chan_out_word(mbinfo[a] MV[0][0][ l],LinkOOutput); 
chan_out_word(mbinfo[a].MV[0][l][0],Link0Output); 
chan_out_word(mbinfo[a].MV[0] [ 1 ] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] MV[l][0][0],Link0Chitput); 
chan_out_word(mbinfo[a].MV[ 1 ] [0] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[ 1 ] [ 1 ] [0],LinkOOutput); 
chan_out_word(mbinfo[a] M V[ I ][ I J [ l],LinkOOutput), 
chan_out_word(mbinfo[a] mv_field_sel[0][0],Link0Output); 
chanout _word(mbinfo[a] . mv_field_sel [0] [ 1 ] .LinkOOutput): 
chan_out_word(mbinfo[a].mv_field_sel[l)[0],LinkOOutput); 
chan_out_word(mbinfola].mv_field_sel{l]l\],LinkOOulput): 
} 

}while(loop<nfirames), 

chan_in_word(&loop,Link 1 Input); 
chan_in_word(&Ioop,Link2Input); 
chan_out_word(loop,LinkOOutput); 

}/*end of main()*/ 


static void frame estimate _ANSI_ARGS_((char *org, 
char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dficldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2] [2])); 

static float fullsearch _ANSI_ARGS_((char *org, 
char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jnunp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx. 


char hx, 
char hy, 
char h, 

float distlim)); 

static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h)); 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float bdist2 _ANSl_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char ♦neworg, 
char *oldref, char *newref, 
char *cur, 
int sxf, 
int syf, 
int sxb, 
int syb, 

struct mbinfo *mbi) 

{ 

int i, j; 

int iminjmin,iminfjminf,iminr jminr; 
int imintjinint.iminbjminb; 
int imintf jmintf.iminbfjminbf, 
int imintr jmintr.iminbr jminbr; 
float var.vO; 

float dmc,dmcf,dmcr,dmci,vmc,vmcf,vmcr,vmci; 
float dmcfield,dmcfleldf,dmcfleldr,dincfieldi; 
int tsel,bsel,tself,bself,tselr,bselr; 
char *mb; 


int imins[2][2] jmins[2][2]; 
int a, ii,jj; 

/* loop through 36 macroblocks of the picture */ 

for(a=36; a<72; a++) 

{ 

ij = a»4; 
ii = a - (ij«4); 
j =ij«4; 
i = ii«4; 


mb = cur + i + width*j; 


var = variance(mb, width); 

/* for I_type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MBfNTRA; 

/* for P_type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&imin > &jmin > &imint,&jmint > &iminb,&jminb, 

&dmc,&dmcfield,&tsel,&bsel,iminsjmins); 

/* select between frame and field prediction */ 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width,imin& 1 jmin& 1 , 16); 

} 

else 

{ 

mbi->motion_type = MC_FIELD; 
dmc = dmcfield; 

vmc = disl2(oldref+(tsel?width:0)+(imint»l)+(width«l)*(jmint»l), 
mb,width« 1 ,imint& 1 Jmint& 1 ,8); 

vmc+=dist2(oldref+(bsel?width:0)+(iminb»l)+(width«l)*(jminb»l), 
mb+width,width« 1 ,iminb& 1 jminb& 1 ,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 

vO = dist2(oldref+i+width*j,mb,width,0,0,16); 
if (4*v0>5*vmc && vO>=9*256) 

{ 


/* use MC */ 
var = vmc; 

mbi->mb_type = MBFORWARD; 
if (mbi->motion_type=MC_FRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0][I] = jmin - (j <<1 ); 

} 

else /* if (mbi->motion_type=MC_FIELD) */ 

{ 

I* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - (j <<: l); 
mbi->MV[l][0][0] = iminb - (i«l); 
mbi->MV[l][0][l] = (jminb«l) - (j«l); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

} 

} 

else 

{ 

/* No-MC */ 
var = vO; 

mbi->mb_type = 0; 
mbi->motion_type = MC_FRAME; 
mbi->MV[0][0][0] = 0; 
mbi->MV[0]10][l] = 0; 

} 

} 

} 


/* for B_type pictures */ 

if (pict_type=B_TYPE) 

{ 

/* forward prediction */ 
frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf > &dmcfieldf,&tself,&bself,iminsjmins); 

/* backward prediction */ 
frame_estimate(neworg,newref,mb,ij,sxb > syb, 

&iminr,&jminr > &imintr,&jmintr,&iminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

/* calculate interpolated distance */ 

/* frame */ 

dmci = bdistl(oldref+(iminf»l)+width*(jniinf»l), 
newref+(iminr» 1 )+width*(jminr > > 1 ), 
mb,width > iminf& 1 jminf& 1 ,iminr& 1 jminr& 1 , 16); 

/* top field 

dmcfieldi = bdistl(oldref+(imintf»l)+(tself?width:0)+(width«i)*(jmintf»l), 


newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1,8); 

/* bottom field 

dmcfieldi+= bdist 1 (oldref+(iminbf» 1 )+(bself?width'0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bseIr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width«l ,iminbf& 1 jminbf& 1 ,immbr& 1 jminbr& 1,8), 

if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

/* frame, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf»l)+width*(jminf»l), 
ne\vref+(iniinr»l)+\vidth*(jminr»l), 
mb,width,iminf& 1 jminf& 1 ,iminr& 1 jminr& 1 , 16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

/* field, interpolated *1 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FIELD; 

vmc = bdist2(oldref+(imintl>>l)+(tself?width:0)+(width«l)*(jmintf^ > >l). 
newref+(imintr» 1 )+(tselr?width:0)+(width«l )*(jmintr» 1 ), 
mb,width«l ,imintf& 1 jmintf&l ,imintr& 1 jmintr& 1 ,8), 
vmc+= bdist2(oldref+(iminbf»l)+(bself7width:0)+(width«l)*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width«l)*(jminbr» 1 ), 
mb+width,width«l ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1 ,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(oIdref+(iminf»l)+width*(jminf»l),mb, 
width,iminf& 1 jminf& 1,16); 

} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

/* field, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FIELD; 

vmc = dist2(oldref+(tself?width:0)+(imintf»l)+(width«l)*(jmintfi»l), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,8); 

vmc+= dist2(oldref+(bself?width:0)+(iminbf» 1 )+(width« l)*(jminbf^ > > 1 ), 
mb+width,width«l ,iminbf& 1 jminbf& 1 ,8); 

} 

else if (dmcr<dmcfieldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi->motion_type = MC_FRAME; 


vmc = dist2(newref+(iminr»l)+width*(jminr»l),mb, 
width,iminr&ljminr&l,16); 

} 

else 

{ 

/* field, backward */ 
mbi->mb_type = MB_BACKWARD; 
mbi->motion_type = MCJTELD; 

vmc = dist2(newref+(tselr?width:0)+(imintr»l)+(width«l)*(jniintr»l), 
mb,width« 1 ,imintr& 1 jmintr& 1 ,8); 

vmc+= dist2(newref+(bselr?\vidth:0)+(iminbr»l)+(width«l)*(jminbr»l), 
mb+width,width« 1 ,iminbr&l jminbr& 1 ,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MB_INTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_type=MC_FRAME) 

{ 

/* forward */ 

mbi->MV[0][0][0] = iminf - (i«l); 
mbi->MV[0][0][l] = jminf - (j«l); 

/* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - (j-^l); 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward */ 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0][0][l] = Omintf«l) - (j«l); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0]Il] = Ominbf«l) - 0«1); 
mbi->mv_field_sel[0][0] = tself; 
mbi->mv_field_sel[l][0] = bself; 

/* backward */ 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - (j«l); 
mbi->MV[l][l][0] = iminbr - (i«l); 
mbi->MV[l][l][l] = (jminbr«l) - 0«1); 
mbi->mv_field_sel[0)[l] = tselr; 
mbi->mv_field_sel[l][l] = bselr; 

} 

} 

} 

mbi->var = var; 


mbi++; 


static void frame_estimate(char *org, char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db,dmint,dminb; 
int imint,iminbjmint jminb; 

/* frame prediction */ 

♦dframep = fiillsearch(org,ref,mb,width,ij,sx,sy,16,width, height, 
iminpjminp); 

/* predict top field from top field */ 

dt = ful!search(org,ref,mb,width<<l,ij>>l,sx,sy>>l,8,width,height>>I, 
&imint,&jmint); 

/* predict top field from bottom field */ 

db = fullsearch(org+width,ref+width,mb,width<< 1 ,i j» 1 ,sx,sy» 1 ,8, width,height» 1 , 
&iminb,&jminb); 

imins[0][0] = imint; 
jmins[0][0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 

} 

else 

{ 

dmint=db; *imintp=iminb; *jmintp=jminb; *tselp=l, 

} 

/* predict bottom field from top field */ 

dt = fullsearch(org,ref,mb+width,width«l,ij»l,sx,sy»l,8,width,height»l, 
&immt,&jmint); 

/* predict bottom field from bottom field */ 
db = fullsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 


imins[0][l] = imint; 


jmins[0][l] = jmint; 
imins[l][l] = iminb; 
jmins[l][l] = jminb; 

/* select prediction for bottom field */ 
if (db<=dt) 

{ 

dminb=db; *iminbp=iminb; *jminbp=jminb; *bselp=l; 

} 

else 

{ 

dminb=dt; *iminbp=imint; *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fullsearchfchar *org, char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
intymax, 
int *iminp, 
int *jminp) 

{ 

int ij,iminjmin,ilow,ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 
int ii, jj; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) ilow = 0; 


if (ihigh>xmax-16) ihigh = xmax-16; 


jlow = jO - sy; 
jhigh = jO + sy; 


if (jlow<0) jlow = 0; 


if (jhigh>ymax-h) jhigh = ymax-h; 


/* full pel search, spiraling outwards *1 

imin = iO; 
jmin =j0; 


dmin = distl(org+imin+lx*jmin,blk,lx,0Ah,65536.0); 

sxy = (sx>sy) ? sx : sy; 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 

j = jO - 1; 

for (k=0; k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlo\v && j<=jhigh) 

{ 

d = distl(org+i+lx*j,bik,lx,0,0,h,dmin), 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l) j++; 
else if (k<6*l) i~; 
else j-; 

} 

} 

/* half pel */ 

dmin = 65536.0; 

imin = imin « 1; 

jmin = jmin « 1; 

ilow = imin - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin>0); 

jhigh = jmin + (jmin<((ymax-h)«l)); 


for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i++) 

{ 

ii = i » 1 ; 
jj=j» 1 ; 

d = distl(reWi+lx*jj,blk4x4&l j&l,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

*iminp = imin; 


♦jminp = jmin; 
return dmin, 

} 

static float distl(char *blkl, 
char *blk2, 
int lx, 
charhx, 
char hy, 
char h, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char i,j; 
intv; 
float s; 

s = 0; 
pi =blkl; 
p2 = blk2; 


if (!hx && !hy) 
for (j=0;j<h,j++) 

{ 

if ((v = pi [0] - p2[0])<0) v = -v; s+= (float) v; 
if((v = pl[l] -p2[l])<0) v = -v; s+= (float) v; 
if ((v = p 1 [2] -p2[2])<0) v = -v; s+= (float) v; 
if ((v = pi [3] - p2[3])<0) v = -v; s+= (float) v; 
if ((v = pi [4] - p2[4])<0) v = -v; s+= (float) v; 
if((v = pl[5] -p2[5])<0) v = -v; s+= (float) v, 
if((v = pl[6] -p2[6])<0) v = -v; s+= (float) v; 
if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v; 
if ((v = pi [8] - p2[8J)<0) v = -v; s+= (float) v; 
if ((v = pi [9] - p2[9])<0) v = -v; s+= (float) v; 
if ((v = pi [ 10] - p2[10])<0) v = -v; s+= (float) v 
if ((v = pi [1 1] -p2[ll])<0) v = -v; s+= (float) v 
if ((v = pl[12] - p2[12])<0) v = -v; s+= (float) v 
if ((v = pi [ 13] - p2[13])<0) v = -v; s+= (float) v 
if ((v = pl[14] - p2[14])<0) v = -v; s+= (float) v 
if ((v = pi [ 15] - p2[15])<0) v = -v, s+= (float) v 

if (s >= distlim) 
break; 

pl+= lx, 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 


if (v>=0) s+= (float) v, 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0,j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int ) ((pl[i]+pla[i]+l) / 2 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx, 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 -p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi =pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 

} 

static float dist2(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

char *pl,*pla,*p2; 


char i, j; 
intv; 
float s; 

s = 0, 
pi =blkl, 
p2 = blk2; 
if (!hx && !hy) 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v - pl[i] - p2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0;j<h;j++) 

{ 

for (i=0, K16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) 12 - p2[i]); 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pla[i]+l)/2-p2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0; j<h; j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2)/4 - p2[i]); 
s+= (float) (v*v); 

} 


pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

return s; 

} 

static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i,j; 
int v; 
float s; 

pfa = pf + hxf, 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0;j<h;j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 

if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

> 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 


return s; 


static float bdist2(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char ij; 
int v; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb, 
pbc = pbb + hxb; 

s = 0; 

for (j=0,j<h;j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 
s+= (float) (v*v); 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

> 

return s; 

} 

static float variance(char *p, int lx ) 

{ 

char i, j; 
int v; 
float s, s2; 


s = s2 = 0; 

for (j=0; j<16; j++) 


{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

} 

p+= lx- 16; 

} 

s=0.0625*s; 
return s2 - s*s; 

} 



/********************************/ 

/* */ 

/* mO 1 1 .c residing at PE2 *1 

/* *1 

/********************************y 

^include "c:\tc2v2\dos.h" 

^include "c:\tc2v2\float.h" 

#include "c:\tc2v2\math.h" 

^include "c:\tc2v2\ctype.h" 

#include "c.\tc2v2\stdio.h" 

^include "c:\tc2v2\stdlib.h" 

^include "c:\tc2v2\string.h" 

#include "c:\tc2v2\chan.h" 

^include "c:\tc2v2\thread.h" 

^include "c:\tc2v2\time.h" 

#include "c:\tc2v2\ermo.h" 

#define GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

#include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

char *neworg[l]; 

void main(argc,argv,envp,ins,in_ports,outs,out_ports) 
int argc,ins,outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*out_ports[]; 

{ 

/* read parameter file */ 

chan_in_word(&M,LinkOInput); 

chan_in_\vord(&nframes,LinkOinput); 

init(); 

putseqO; 


} 

static void init() 

{ 

int i, size, a; 

chan_in_word(&mb_width,LinkOInput); 

chan_in_word(&mb_height,LinkOInput); 

chan_in_word(&mb_height2,LinkOInput); 

chan_in_word(&width,LinkOInput); 

chan_in_word(&height,LinkOInput); 

chan_in_word(&width2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 


size = width*height ; 


ne\vrefframe[0] = (char *)malloc(size); 
oldrefframc[0] = (char *)maIloc(size); 
neworgframe[0] = (char *)malloc(size); 
oldorgframe[0] = (char *)malloc(size); 
neworgfO] = (char *)malloc(size); 

mbinfo = (struct mbinfo *)malloc(37*sizeof(struct mbinfo)); 

for (a=0; a<37; a++) 

{ 

mbinfo[a].mb_type = mbinfo [a] . moUontype = 0; 

mbinfo[a] MV[0]10][0] = mbinfo[a].MV[0][0][l] = mbinfo[a].MV[0][l][0] = 0, 
mbinfo[a].MV[0][l][l] = mbinfo[a].MV[l][0][0] = mbinfo[a].MV[l][0][l] = 0; 
mbinfo[a].MV[l][l][0] = mbinfo[a].MV[l][l][l] = 0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0; 
mbinfo[a].mv_field_sel[l][0] = mbinfo[a].mv_field_seI[l][l] = 0; 

} 


} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fD, n, np, nb; 
inta; 

int sxf, syf, sxb, syb; 
int size, loop; 

size = width*height; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 

if(loop=nframes) break, 

chan_in_word(&pict_type,LinkOInput); 

chan_in_word(&sxf,LinkOInput); 

chan_in_\vord(&syf,LinkOInput); 

chan_in_word(&sxb,LinkOInput); 

chan_in_word(&syb,LinkOInput); 

if ( pict_type != I_TYPE ) 

{ 

chan_in_message(size,neworg[0],LinkOInput); 
chan_in_message(size,oldorgframe[0],LinkOInput); 
chan_in_message(size,oldrefframe[0],LinkOInput); 
if ( pict type != P TYPE ) 

{ 

chan_in_message(size,neworgframe[0],Link01nput); 


chan_in_message(size,newrefframe[0],Link0Input); 

} 

} 

/* loop through 37 macroblocks of the picture */ 

motion_estimation(oldorgframe[0], neworgframe[0], 
oldrefirame[0], newrefframe[0], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo); 


for (a=0; a<37; a++) 

{ 

chan_out_word(mbinfo[a].mb_type,LinkOOutput); 
chan_out_word(mbinfo[a].motion_type,LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [0] [0],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [0] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [ 1] [0],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [ 1] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[ 1 ] [0] [OJ.LinkOOutput); 
chan_out_word(mbinfo[a] .MV[ 1] [0][ l],LinkOOutput); 
chan_out_word(mbinfo[a].MV[ 1 ] [ 1 ] [0],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[ 1 ] [ 1 ] [ 1 ] .LinkOOutput); 
chan_out_word(mbinfo [a] . mv_field_sel [0] [0] ,LinkOOutput) 
chan_out_word(mbinfo [a] . mvfieldsel [0] [ 1 ] .LinkOOutput) 
chan_out_word(mbinfo[a].mv_field_sel[l][0], LinkOOutput) 
chan_out_word(mbinfo[a] . mvfieldsel [ 1 ] [ 1 ] .LinkOOutput) 
} 

}while(loop<nframes); 
chan_out_word(loop, LinkOOutput); 

} 

/* motion.c, motion estimation */ 

/* private prototypes */ 

static void frame estimate _ANSI_ARGS_((char *org, 
char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2])); 


static float fullsearch _ANSI_ARGS_((char *org, 
char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
intsy, 
char h, 
int xmax, 
intymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 

float distlim)); 

static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h)); 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)), 

static float bdist2 _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char *neworg, 
char *oldref, char *newref, 


char *cur, 
int sxf, 
int syf, 
int sxb, 
int syb, 

struct mbinfo *mbi) 

{ 

int i, j; 

int iminjmin,iminfjminf,iminrjminr; 
int imintjmint,iminbjminb; 
int imintfjmintf.iminbfjminbf; 
int imintr jmintr,iminbr jminbr; 
float var,vO; 

float dmc,dmcf,dmcr,dmci,vmc,vmcf,vmcr,vmci; 
float dmcfield,dmcfieldf,dmcfieldr,dmcfieldi; 
int tsel,bsel,tself,bself,tselr,bselr; 
char *mb; 

int imins[2][2]jmins[2][2]; 
int a, ii,jj; 

/* loop through all macroblocks of the picture */ 

for(a=72; a<109; a++) 

{ 

jj = a»4; 
ii = a - (ij«4); 
j =j|«4; 
i = ii«4; 


mb = cur + i + width*j; 


var = variance(mb, width); 

/* for I type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MBINTRA; 

/* for P_type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg,oldref,mb,ij > sxf,syf ) 

&imin,&jmin,&imint,&jmint,&iminb,&jminb, 

&dmc,&dmcfleld,&tsel,&bsel,iminsjmins), 

/* select between frame and field prediction */ 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width, imin& 1 jmin& 1 , 16); 

} 

else 


{ 

mbi->motion_type = MCFIELD; 
dmc = dmcfield; 

vmc = dist2(oldref+(tsel?width.0)+(imint»l)+(width«l)*(jmint»l), 
mb,width« 1 ,imint& 1 jmint& 1 ,8); 

vmc+= dist2(oldref+(bsel?width0)+(iminb» 1 )+(width« 1 )*(jminb» 1 ), 
mb+width,width« 1 ,iminb&l jminb& 1 ,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MB_INTRA; 
else 
{ 

vO = dist2(oldref+i+width*j,mb,width,0,0,16); 
if (4*vO>5*vmc && vO>=9*256) 

{ 

/* use MC */ 
var = vmc; 

mbi->mb_type = MBFORWARD; 
if (mbi->motion_type=MC_FRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0)[l] = jmin - (j«l); 

} 

else /* if (mbi->motion_type=MC_FIELD) *! 

{ 

/* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - (j«l); 
mbi->MV[l][0][0] = iminb - (i«l); 
mbi->MV[l][0][l] = (jminb«l) - (j«l); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

} 

} 

else 

{ 

/* No-MC */ 
var = vO; 

mbi->mb_type = 0; 
mbi ->motion_type = MCFRAME; 
mbi->MV[0] [0] [0] = 0; 
mbi->MV[0][0][l] =0, 

} 

} 


/* for B_type pictures */ 

if (pict_type=B_TYPE) 

{ 

/* forward prediction */ 
frame_estimate(oldorg,oldref ) mb,ij,sxf,syf. 


&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf,&dmcfieldf,&tself,&bself,iminsjmins); 

/* backward prediction */ 
firame_estimate(neworg,newref,mb,ij,sxb,syb, 

&iminr,&jminr,&imintr,&jmintr,&iminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

/* calculate interpolated distance *1 
/* frame */ 

dmci = bdistl(oldref+(iminf»l)+width*(jminf»l), 
newref+(iminr» 1 )+width*(jminr» 1), 
mb,width,iminf& 1 Jminf& 1 ,iminr& 1 Jminr& 1 , 16); 

/* top field 

dmcfieldi = bdist 1 (oldref+(imintf» 1 )+(tself?width:0)+(width« l)*(jmintfi» 1 ), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1 ), 
mb,width«l ,imintf& 1 jmintf& 1 ,inuntr& 1 jmintr& 1 ,8); 

/* bottom field 

dmcfieldi+= bdist 1 (oldref+(iminbf» 1 )+(bself?width:0)+(width« l)*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width«l ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1,8); 

if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

/* frame, interpolated */ 

mbi->mb_type = MB_FORWARD]MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf» l)+width*(jminf» 1 ), 
newref+(iminr» 1 )+width*(jminr»l), 
mb,width,iminf& 1 Jminf& 1 ,iminr& 1 Jminr& 1 , 16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

/* field, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MCFIELD; 

vmc = bdist2(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jmintf»l), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1 ), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,imintr&l jmintr& 1 ,8); 
vmc+= bdist2(oldref+(iminbf» 1 )+(bself?width:0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« l)*(jminbr» 1 ), 
mb+width,width«l ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr&l ,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FRAME, 
vmc = dist2(oldref+(iminf»l)+width*(jminf»l),mb, 
width, iminf& 1 jmmf& 1,16); 


} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

/* field, forward */ 
mbi->mb_type = MBFORWARD, 
mbi->motion_type = MC_FIELD; 

vmc = dist2(oldref+(tself?width:0)+(imintf»l)+(width«l)*(jmintf»l), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,8); 

vmc+= dist2(oldref+(bself?width:0)+(iminbf»l)+(width«l)*(jminbf»l), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1 ,8); 

} 

else if (dmcr<dmcfieldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(newref+(iminr»l)+width*(jminr»l),mb, 
width, imi nr& 1 jminr& 1 , 16); 

} 

else 

{ 

/* field, backward */ 
mbi->mb_type = MBBACKWARD, 
mbi->motion_type = MC_FEELD; 

vmc = disL2(ncwref+(Iselr?width:0)+(imintr»l)+(vvidth«l)*(jmintr»l), 
mb,width« 1 ,imintr& 1 jmintr& 1 ,8); 

vmc+= dist2(newref+(bselr?width 0)+(iminbr» 1 )+(width« 1 )*(jminbr» 1 ), 
mb+width, width« 1 ,iminbr& 1 jminbr& 1,8); 

} 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_type=MC_FRAME) 

{ 

/* forward */ 

mbi->MV[0][0][0] = iminf - (i«l); 
mbi->MV[0] [0] [ 1 ] = jminf - (j«l); 

I* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - 0 <<: 1); 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward */ 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0][0][l] = (jniintf«l) - (j«l); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0][l] = (jminbf«l) - (j«l); 
mbi->mv _field_sel[0][0] = tself; 
mbi->mv_field_selt 1 ] [0] = bself, 


/* backward */ 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - (j«l); 
mbi->MV[l][l][0] = iminbr - (i«l); 
mbi->MV[l][l][l] = (jminbr«l) - (j«l); 
mbi->mv_field_sel[0][l] = tselr; 
mbi->mv_field_sel[l][l] = bselr; 

} 

} 

} 

mbi->var = var; 
mbi++; 

} 

} 


static void frame_estimate(char *org, char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float ♦dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db,dmint,dminb; 
int imint,iminbjmintjminb; 

/* frame prediction */ 

*dframep = fullsearch(org,ref, mb, width, ij,sx,sy, 16, width, height, 
iminpjminp); 

/* predict top field from top field */ 

dt = fullsearch(org, ref, mb, width«l,ij»l,sx,sy» 1,8, width, height»l, 
&imint,&jmint); 

/* predict top field from bottom field *1 

db = fullsearch(org+width,ref+width,mb,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1, 
&iminb,&jminb); 

imins[0][0] = imint; 
jmins[0][0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 


} 

else 

{ 

dmint=db, *imintp=iminb; *jmintp=jminb, *tselp=l; 

} 

/* predict bottom field from top field */ 

dt = fullsearch(org,ref,mb+width,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1, 
&imint,&jmint); 

/* predict bottom field from bottom field */ 
db = fiillsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 

imins[0][l] = imint; 
jmins[0][l] = jmint; 
imins[l][l] = iminb; 
jmins[ 1 ][ 1 ] = jminb; 

/* select prediction for bottom field *! 
if (db<=dt) 

{ 

dmmb=db; *iminbp=iminb; *jminbp=jminb; *bselp=l; 

} 

else 

{ 

dminb=dt; ♦iminbp=imint; *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fullsearch(char *org, char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
intsy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp) 

{ 

int ij,iminjmin,ilow,ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 
int ii, jj; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) 
ilow = 0; 


if (ihigh>xmax-16) 
ihigh = xjnax-16; 


jlo\v=jO-sy; 
jhigh = jO + sy; 


if (jlow<0) 
jlow = 0; 


if (jhigh>ymax-h) 
jhigh - ymax-h; 


/* full pel search, spiraling outwards *1 

imin = iO; 
jmin = jO; 

dmin = distl(org+imin+lx*jmin,blk,lx,0,0,h,65536.0); 

sxy = (sx>sy) ? sx : sy; 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 
j = j0 - 1; 

for (k=0; k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = distl(org+i+lx*j,blk,lx,0,0,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l)j++; 
else if (k<6*l) i~; 
else j-; 

} 

} 

/* half pel */ 
dmin = 65536.0; 
imin = imin « 1; 
jmin = jmin « 1; 
ilow = imin - (imin>0); 


ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin>0); 

jhigh = jmin + (jmin<((ymax-h)«l)); 


for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i++) 

{ 

ii = i » 1 ; 

ju=j» i ; 

d = distl(ref+ii+lx*jj,blk,lx,i&l j&l,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

*iminp = imin; 

*jminp = jmin; 

return dmin; 

} 


static float distl(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0; 

pi = blkl; 
p2 = blk2; 

if (!hx && !hy) 
for (j=o;j<h;j++) 

{ 

if ((v = pi [0] - p2[0])<0) v = -v, s+= (float) v 
if ((v = p 1 [ 1 ] - p2[l])<0) v = -v; s+= (float) v 
if ((v = pi [2] - p2[2])<0) v = -v; s+= (float) v 
if ((v = pi [3] - p2[3])<0) v = -v; s+= (float) v 
if ((v = pi [4] - p2[4])<0) v = -v; s+= (float) v 
if((v = pl[5] -p2[5])<0) v = -v; s+= (float) v 
if ((v = pi [6] - p2[6])<0) v = -v; s+= (float) v 
if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v 



if((v = pl[8] -p2[8])<0) v = -v; s+= (float) v; 
if ((v = pi [9] - p2[9])<0) v = -v; s+= (float) v; 
if ((v = pi [10] - p2[10])<0) v = -v; s+= (float) v; 
if ((v = pi [1 1] - p2[l 1])<0) v = -v; s+= (float) v; 
if ((v = pl[12] - p2[12])<0) v = -v; s+= (float) v; 
if ((v = pl[13] - p2[13])<0) v = -v; s+= (float) v; 
if ((v = pi [14] - p2[14])<0) v = -v; s+= (float) v; 
if ((v = pi [15] - p2[15])<0) v = -v; s+= (float) v; 

if (s >= distlim) 
break; 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l)/2-p2[i]); 
if (v>=0) s+= (float) v; 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int ) ((pl[i]+pla[i]+l) / 2 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 

pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0;j<h,j++) 

{ 

for(i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
if (v>=0) 


s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pi a; 
pIa+= lx; 
p2+= lx; 

} 

} 

return s; 

} 


static float dist2(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

char *pl,*pla, ,, ‘p2; 
chari, j; 
intv; 
float s; 

s = 0; 

pi = blkl; 
p2 = blk2; 
if (!hx && !hy) 
for (j=0, j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = pl[i] -p2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 


for O=0;j<h;j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pla[i]+l) / 2 - p2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) •/ 

{ 

pla = pi + lx; 
for (j=0;j<h;j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 

} 


static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb > *pfc,*pba,*pbb,*pbc; 
char i,j; 
intv; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 


for (j=0;j<h;j++) 

{ 


for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2)/4 + 1 ) / 2 
- *p2++); 

if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

p2+= lx- 16; 
pf+= lx- 16; 
pfa+= lx- 16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 


static float bdist2(char ♦pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa, *pfb, *pfc, *pba, *pbb, *pbc; 
char i, j; 
int v; 
float s; 


pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 


pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0;j<h;j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2)/4 + l)/2 
- *p2++); 
s+= (float) (v*v); 


} 

p2+= lx- 16, 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16, 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 


static float variance(char *p, int lx ) 

{ 

char i, j; 
int v; 

float s, s2; 

s = s2 = 0; 

for (j=0;j<16;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

} 

p+= lx-16; 

} 

s=0.0625*s; 
return s2 - s*s; 

} 


/********************************/ 


/* 


*/ 

/* 

m012.c residing at PE3 

*/ 

/* 


*/ 


/********************************/ 


^include "c:\tc2v2\dos.h" 
^include H c:\tc2v2\float.h n 
^include "c:\tc2v2Vmath.h" 
#include "c:\tc2v2\ctype.h" 
^include "c:\tc2v2\stdio.h" 
^include ,, c:\tc2v2\stdlib.h" 
^include "c:\tc2v2\string. h" 
^include "c:\tc2v2\chan.h" 
#include n c:\tc2v2\thread.h ,, 
tfinclude "c:\tc2v2\time.h" 
#include "c:\tc2v2\ermo h" 


#define GLOBAL /* used by global.h */ 

#include "c:\nick\h\const l.h" 

#include "c:\nick\h\config.h" 

^include M c:\nick\h\global.h M 

static void init _ANSI_ARGS J(void)); 

char *neworg[l]; 

void main(argc,argy,envp,ins,in_ports,outs,out_ports) 
int argc, ins, outs; 
char *argv[],*envp[]; 

CHAN *in_ports[] J *out_ports[], 

{ 

/* read parameter file */ 

chan_in_word(&M,LinkOInput); 

chan_in_word(&nframes,LinkOInput); 

init(), 

putseq(); 


static void init() 

{ 

int 1 , size, a; 

chan_in_word(&mbj\vidth,LinkOInput); 

chan_in_\vord(&mb_height,LinkOInput); 

chanJn_word(&mb_height2,LinkOInput), 

chan_in_word(&vvidth,LinkOInput); 

chan_in_word(&height,LinkOInput), 

chan_in_word(&width2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 


size = width*height ; 


newrefframe[0] = (char *)malloc(size); 
oldre£frame[0] = (char *)malloc(size); 
neworgframefO] = (char *)malloc(size); 
oldorgframe[0] = (char *)malloc(size); 
neworg[0] = (char *)malloc(size); 

mbinfo = (struct mbinfo *)malloc(37*sizeof(struct mbinfo)); 

for (a=0; a<37; a++) 

{ 

mbinfo[a].mb_type = mbinfo[a].motion_type = 0; 

mbinfo[a].MV[0][0][0] = mbinfo[a].MV[0][0][l] = mbinfo[a].MV[0][l][0] = 0; 
mbinfo[a].MV[0][l][l] = mbinfo[a].MV[l][0][0] = mbinfo[a].MV[l][0][l] = 0; 
mbinfo[a].MV[l][l][0] = mbinfo[a].MV[l][l][l] =0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0; 
mbinfo[a].mv_field_sel[l][0] = mbinfo[a].mv_field_sel[l][l] = 0, 

} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 •/ 
int i, j, k, f, fO, n, np, nb; 
int a; 

int sxf, syf, sxb, syb; 
int size, loop, 

size = width*height; 


/* loop through all frames in encoding/decoding order *! 
do{ 

chan_in_word(&loop,LinkOInput); 

if(loop=nframes) break; 

chan_in_word(&pict_type,LinkOInput); 

chan_in_word(&sxf,LinkOInput), 

chan_in_word(&syf,LinkOInput); 

chan_in_word(&sxb,LinkOInput); 

chan_in_word(&syb,LinkOInput); 

if ( pict_type != I TYPE ) 

{ 

chan_in_message(size,neworg[0],LinkOInput); 
chan_in_message(size,oldorgframe[0],LinkOInput); 
chan_in_message(size,oldrefframe[0],Link01nput); 
if ( pict type != P TYPE ) 

{ 

chan_in_message(size,neworgframe(0],Link01nput); 


chan_in_message(size,newrefframe[0],LinkOInput), 

} 

} 

/* loop through 37 macroblocks of the picture */ 

motion_estimation(oldorgframe[0], neworgframefO], 
oldrefTrame[0], ne\vre£frame[0], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo); 

for (a=0; a<37; a++) 

{ 

chan_out_word(mbinfo[a] . mb_type,LmkOOutput); 
chan_out_word(mbinfo[a].motion_type,LinkOOutput); 
chanout _word(mbinfo[a] .MV[0] [0] [0],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [0] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [ 1 ] [0],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [ 1 ] [ l].LinkOOutput); 
chan_out_word(mbinfo[a].MV[ 1 ] [0] [0],LinkOOutput); 
chan_out_word(mbinfo[a] ,MV[ 1] [0] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] ,MV[ 1 ] [ 1 ] (0],LinkOOutput); 
chan_out_word(mbinfo[a] ,MV[ 1 ] [ 1 ] [ 1 ] .LinkOOutput); 
chan_out_word(mbinfo[a].mv_field_sel[0][0],Lmk0Output) 
chan_out_word(mbinfo[a] . mv_field_sel [0] [ 1 ] ,LinkOOutput) 
chan_out_word(mbinfo[a].mv_field_sel[l][0],LinkOOutput) 
chan_out_word(mbinfo[a] . mv_field_sel [ 1 ] [ 1 ] ,LinkOOutput) 
} 

}while(loop<nframes); 

chan_out_word(loop,LinkOOutput); 


} 

static void frame_estimate _ANSI_ARGS_((char *org, 
char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2])); 

static float fullsearch _ANSI_ARGS_((char *org, 
char *ref, 
char *blk, 
int lx, 
int iO, 



intjO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 

float distlim)); 

static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h)); 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float bdist2 _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char *neworg, 
char *oldref, char *newref, 
char *cur, 
int sxf, 
int syf, 
int sxb, 
int syb. 


struct mbinfo *mbi) 


{ 

inti.j; 

int iminjmin,iminfjminf,iminrjminr; 
int imintjmint,iminbjminb; 
int imintf jmintf,iminbf jminbf, 
int imintrjmintr,iminbr jminbr; 
float var,vO; 

float dmc,dmcf,dnicr,dmci,vnic ) vincf,vmcr,vnici; 
float dmcfield.dmcfieldf.dmcfieldr.drncfieldi; 
int tsel,bsel,tself,bself,tselr,bselr, 
char *mb; 

int imins[2][2]jmins[2][2); 
int a, ii, jj; 

/* loop through 37 macroblocks of the picture */ 

for(a=109; a<146; a++) 

{ 

jj = a»4; 
i> = a - (i)«4), 
j=jj«4; 
i = ii«4, 


mb = cur + i + width*j; 


var = variance(mb, width); 

/* for I type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MBINTRA; 

/* for P type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg,oldref,mb,ij,sxf > syf, 

&imin,&jmin,&imint,&jmint,&iminb,&jminb, 

&dmc,&dmcfield,&tsel,&bsel,iminsjmins); 

/* select between frame and field prediction */ 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MCFRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width,imin& 1 jmin& 1 , 1 6); 

} 

else 

{ 

mbi->motion_type = MCFIELD, 
dmc = dmcfteld; 

vmc = dist2(oldref+(tsel?width0)+(imint» 1 )+(width« 1 )*(jmint» 1 ), 
mb,width« 1 ,imint& 1 jmint& 1,8); 


vmc+= dist2(oldref+(bsel?width0)+(iminb» 1 )+(width« 1 )*(jminb» 1 ), 
mb+width,width<< 1 ,iminb& 1 jminb& 1 ,8); 

} 

if (vmOvar && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

vO = dist2(oldref+i+width*j,mb,width,0,0,16); 
if (4*vO>5*vmc && v0>=9*256) 

{ 

/* use MC */ 
var = vmc; 

mbi->mb_type = MB_FOR W ARD ; 
if (mbi->motion_type— MCJFRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0][l] = jmin - (j«l); 

} 

else /* if (mbi->motion_type=MC_FIELD) *1 

{ 

I* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - (jcci); 
mbi->MV[l][0][0] = iminb - (i«l), 
mbi->MV[l][0][l] = (jminb«l) - (j«:l); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

} 

} 

else 

{ 

/* No-MC */ 
var = vO; 

mbi->mb_type = 0; 
mbi->motion_type = MC_FRAME; 
mbi->MV[0][0][0] = 0, 
mbi->MV[0][0][l] = 0; 

} 

} 

} 


/* for B type pictures */ 

if (pict type=B_TYPE) 

{ 

/* forward prediction */ 
frame_estimate(oldorg,oldref,mb,ij,sxf > syf, 

&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf,&dmcfieldf,&tself,&bself,iminsjmins); 

/* backward prediction */ 
frame_estimate( neworg, newref, mb, i j, sxb, syb, 


&iminr,&jminr,&imintr,&jmintr,&iminbr,&jminbr ) 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

/* calculate interpolated distance */ 

/* frame */ 

dmci = bdistl(oldref+(iminf»l)+width*(jminf»l), 
newref+(iminr» 1 )+width*(jminr» 1 ), 
mb, width, iminf& 1 jminf& 1 ,iminr& 1 jminr&l , 16); 

/* top field 

dmcfieldi = bdist 1 (oldref+(imintf» 1 )+(tself?width:0)+(width« 1 )*(jmintf» 1 ), 
newref+(imintr» 1 )+(tselr?width:0)+(width« l)*(jmintr» 1 ), 
mb, width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1 ,8); 

/* bottom field 

dmcfieldi+= bdist 1 (oldref+(iminbf» 1 )+(bself?widthO)+(width« 1 )*(jntinbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width«l ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbr& 1,8); 


if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

/* frame, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf» 1 )+width*(jminf» 1), 
newref+(iminr» 1 )+width*(jminr» 1 ), 
mb,width,iminf& 1 jminf&l ,iminr& 1 jminr& 1 , 16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

I* field, interpolated *1 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MCFEELD; 

vmc = bdist2(oldref+(imintf» 1 )+(tself?width:0)+(width« 1 )*(jmintf» 1 ), 
newref+(imintr»l)+(tselr?width:0)+(width«l)*(jnuntr»l), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,immtr& 1 jmintr& 1,8); 
vmc+= bdist2(oldref+(iminbf»l)+(bselPwidth:0)+(width«l)*(jnunbf»l), 
newref+(iminbr» 1 )+(bselr?width :0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width«l,iminbf&l jminbf&l,iminbr&l jminbr&l,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MCFRAME; 
vmc = dist2(oldref+(iminf»l)+width*(jminf»l),mb, 
width,iminf& 1 jminf& 1 , 1 6); 

} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

I* field, forward */ 



mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FIELD; 

vmc = dist2(oldref+(tseU?\vidth.0)+(imintf» 1 )+(width« 1 )*(jmintf» 1 ), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,8); 

vmc+= dist2(oldref+(bself?width:0)+(inunbf»l)+(width«l)*(jininbf»l), 
mb+widlh,width« 1 ,iminbf& 1 jminbf& 1 ,8); 

} 

else if (dmcr<dmcfteldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MB_BACKWARD; 
mbi->motion_type = MC_FRAME, 
vmc = dist2(newref+(iminr»l)+width*(jminr»l),mb, 
width, iminr& 1 Jminr& 1 , 1 6); 

} 

else 

{ 

/* field, backward */ 
mbi->mb_type = MB_BACKWARD; 
mbi->motion_type = MCFEELD; 

vmc = dist2(newref+(tselr?width0)+(imintr>>l)+(width<<l)*(jmintr>>l), 
mb,width« 1 ,imi ntr& 1 jmintr& 1,8); 

vmc+= dist2(newref+(bselr?width:0)+(iminbr» 1 )+(width«l)*(jminbr» 1 ), 
mb+width,width« 1 ,iminbr& 1 jminbr& 1,8); 

} 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_type=MC_FRAME) 

{ 

/* forward */ 

mbi->MV[0][0][0] = iminf - (i«l), 
mbi->MV[0][0][l] = jminf - (j«l); 

/* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - (j <<: l); 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward */ 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0][0][l] = (jmintf«l) - 0 <<: 1); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0][l] = (jminbf«l) - (j <<: l); 
mbi->mv_field_sel[0][0] = tself; 
mbi->mv_field_sel[l][0] = bself; 

/* backward *1 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - (j«l); 
mbi->MV[l][l][0] = iminbr - (i«l); 


mbi->MV[I][l][l] = (jminbr«l) - (j«l); 
mbi->mv_field_sel[OJ[l] = tselr; 
mbi->mv_field_sel[l][l] = bselr; 

} 

} 

} 

mbi->var = var; 
mbi++; 


} 

> 


static void frame_estimate(char *org, char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int ♦tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db.dmint,dminb; 
int imint,iminbjmintjminb; 

/* frame prediction */ 

*dframep = fiillsearch(org,ref,mb,width,ij,sx,sy,16,width,height, 
iminpjminp); 

/* predict top field from top field */ 

dt = fullsearch(org,ref,mb,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&imint,&jmint); 

/* predict top field from bottom field */ 

db = fullsearch(org+width,ref+\vidth,mb,width« 1 ,i j» 1 ,sx,sy» 1 ,8, width, height» 1 , 
&iminb,&jminb); 

imins[0][0] = imint; 
jmins[0][0] = jmint, 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if(dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 

} 

else 

{ 

dmint=db; *imintp=iminb, *jmintp=jminb; *tselp=l; 


} 


/* predict bottom field from top field */ 

dt = fullsearch(org,ref,mb+width,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&imint,&jmint); 

/* predict bottom field from bottom field */ 
db = fullsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 

imins[0][l] = imint; 
jmins[0][l] =jmint; 
imins[l][l] = iminb; 
jmins[l][l] =jminb; 

/* select prediction for bottom field */ 
if (db<=dt) 

{ 

dminb=db; *iminbp=iminb; *jminbp=jminb; *bselp=l; 

} 

else 

{ 

dminb=dt, *iminbp=imint, *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fullsearch(char *org, char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp) 

{ 

int ij,iminjmin,ilow,ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 
int ii, jj; 


ilow = iO - sx, 
ihigh = iO + sx; 

if (ilow<0) ilow = 0; 

if (ihigh>xmax-16) ihigh = xmax-16; 


jlow = jO - sy, 
jhigh =jO + sy, 

if (jlow<0) jlow = 0; 

if (jhigh>ymax-h) jhigh = ymax-h; 

/* full pel search, spiraling outwards */ 

imin = iO; 
jmin = jO; 

dmin = distl(org+imin+lx*jmin,blk,lx,0,0,h, 65536.0); 

sxy = (sx>sy) ? sx : sy; 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 

j = j° - U 

for (k=0, k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = distl(org+i+lx*j,blk,lx,0,0,h,dmin), 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l) j++; 
else if (k<6*l) i--; 
else j--; 

} 

} 

/* half pel */ 

dmin = 65536.0; 

imin = imin « 1; 

jmin = jmin « 1; 

ilow = imin - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin>0); 

jhigh = jmin + (jmin<((ymax-h)«l)); 

for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i++) 

{ 

ii = i » 1 ; 


jj=j» i ; 

d = distl(ref+ii+lx*jj,blk,lx,i&l j&l,h,dmin); 

if (d<dnun) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

*iminp = imin, 

*jminp = jmin, 

return dmin; 

} 

static float distl(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
intv; 
float s; 

s = 0; 
pi = blkl; 
p2 = blk2; 

if (!hx && !hy) 
for 0=0; j<h;j++) 

{ 

if ((v = pi [0] - p2[0])<0) v = -v; s+= (float) v; 
if((v = pl[l] -p2[l])<0) v = -v; s+= (float) v; 
if ((v = pi [2] - p2[2])<0) v = -v; s+= (float) v; 
if ((v = pi [3] - p2[3])<0) v = -v; s+= (float) v; 
if ((v = pi [4] - p2[4])<0) v = -v; s+= (float) v; 

if ((v = pi [5] -p2[5])<0) v = -v; s+= (float) v; 

if ((v = pi [6] - p2[6])<0) v = -v; s+= (float) v; 

if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v; 

if ((v = pi [8] - p2[8])<0) v = -v; s+= (float) v; 

if((v = pl[9] -p2[9])<0) v = -v; s+= (float) v; 
if ((v = pl[10] - p2[10])<0) v = -v; s+= (float) v; 
if ((v = pi [1 1] - p2[l 1])<0) v = -v; s+= (float) v; 
if ((v = pl[12) - p2[12])<0) v = -v; s+= (float) v; 
if ((v = pi [13] - p2[13])<0) v = -v; s+= (float) v; 
if ((v = pi [14] - p2[14])<0) v = -v; s+= (float) v; 
if ((v = pl[15] - p2[15])<0) v = -v; s+= (float) v; 


if (s >= distlim) 
break; 


pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
if (v>=0) s+= (float) v; 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

v = (int ) ((pl[i]+pla[i]+l) / 2 - p2[i]), 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) *1 

{ 

pla = pi + lx; 
for (j=0,j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 


return s; 

} 

static float dist2(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

char *pl,*pla,*p2; 
char ij; 
int v; 
float s; 

s = 0; 

pi = blKl; 
p2 = blk2; 
if (!hx && !hy) 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = pl[i] - P 2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
s+= (float) (v*v), 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pla[i]+l) / 2 - p2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 


} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0;j<h;j-H-) 

{ 

for (i=0, i<16; i++) 

{ 

v = (int) ((pi [i]+pl [i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

return s; 

} 

static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
int v; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb-H- + *pbc++ + 2) / 4 + 1) / 2 

- *p2++); 

if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 


} 


p2+= lx- 16; 
pf+= lx-16, 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 

static float bdist2(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char ij; 
intv, 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + ♦pbc++ + 2) / 4 + 1) / 2 

- *p2++); 

s+= (float) (v*v); 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 


return s; 

} 

static float vanance(char *p. int lx ) 

{ 

char i, j; 
int v; 
float s, s2; 

s = s2 = 0; 

for (j=0J<16;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

} 

p+= lx- 16; 

} 

s=0.0625*s; 
return s2 - s*s; 

} 



/********************************/ 
/* *1 

I* m02.c residing at PE4 */ 

/* •/ 

/********************************/ 


^include "c:\tc2v2\dos.h" 

#include "c:\tc2v2\float.h" 

^include "c:\tc2v2\math.h" 

^include "c:\tc2v2\ctype.h" 

^include "c:\tc2v2\stdio.h" 

^include "c:\tc2v2\stdlib.h" 

^include "c:\tc2v2\string.h" 

^include "c:\tc2v2\chan.h" 

^include ''c:\tc2v2\thread.h" 

#include "c:\tc2v2\time.h" 

#include "c:\tc2v2\ermo.h” 

^define GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h" 

^include "c:\nick\h\config.h" 

^include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

char *neworg[l]; 

void main(argc,argv,envp, ins, in_ports, outs, out _ports) 
int argc,ins,outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*out_ports[]; 

( 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_out_word(M,LinklOutput); 
chan_outvvord(M,Link2 Output); 

chan_in_word(&nframes,LinkOInput); 
chan_out_word(nframes,Link 1 Output); 
chan_out_word(nfirames,Link20utput); 

init(), 

putseqO; 


} 

static void init() 

{ 

int i, size, a; 

chan_in_word(&mb_width,LinkOInput); 
chan_out_word(mb_width, Link 1 Output); 
chan_out_word(mb_width,Link20utput); 


chan_in_word(&mb_height,LinkOInput); 

chan_out_word(mb_height,LinklOutput), 

chan_out_word(mb_height,Link20utput); 

chan_in_word(&mb_height2,LinkOInput); 

chan_out_word(mb_height2,LinklOutput); 

chan_out_word(mb_height2,Link20utput); 

chan_in_word(&width,LinkOInput); 
chan_out_word(width,Link 1 Output); 
chan_out_word(width,Link20utput); 

chan_in_word(&height,LinkOInput); 
chan_out_word(height,Link 1 Output); 
chan_out_word(height,Link20utput); 

chan_in_word(&width2,LinkOInput); 

chan_out_word(width2,LinklOutput); 

chan_out_word(width2,Link20utput); 

chan_in_word(&height2,LinkOInput); 
chan_out_word(height2,Link 1 Output); 
chan_out_word(height2,Link20utput); 

size = width*height ; 

newreflFrame[0] = (char *)malloc(size); 
oldrefframefO] = (char *)malIoc(size); 
neworgframe[0] = (char *)malloc(size); 
oldorgframe[0] = (char *)malloc(size); 
neworg[0] = (char *)malloc(size); 

mbinfo = (struct mbinfo "‘)malloc(l 10*sizeof(struct mbinfo)); 

for (a=0; a<l 10; a++) 

{ 

mbinfo[a].mb_type = mbinfo[a].motion_type = 0; 

mbinfo[a] .MV[0] [0] [0] = mbinfo[a].MV[0)[0]tl] = mbinfo[a].MV[0][l][0] = 0; 
mbinfo[a].MV[0][l][l] = mbinfo[a] MV[1][0][0] = mbinfo(a].MV[l][0][l] = 0; 
mbinfo[a] . MV[ 1 ] [ 1 ] [0] = mbinfo[a] MV[1)[1][1] = 0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0; 
mbinfo[a].mv_fie!d_sel[l][0] = mbinfo[al.mv_field_seI[l]flJ = 0; 

} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fO, n, np, nb; 
int a; 



int s xf, syf, sxb, syb; 
int size, loop, 

size = width*height; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 
chan_out_word(loop,Link 1 Output); 
chan_out_word(loop,Link20utput); 

if(loop=nframes) break; 

chan_in_word(&pict_type,LinkOInput); 

chan_out_word(pict_type,LinklOutput); 

chan_out_word(pict_type,Link20utput); 

chan_in_word(&sxf,LinkOInput); 
chan_out _word(sxf,Link 1 Output); 
chan_out_word(sxf,Link20utput); 

chan_in_word(&syf,LinkOInput); 
chan_out_word(syf,Link 1 Output); 
chan_out_word(syf,Link20utput); 

chan_in_word(&sxb,LinkOInput); 
chan_out_word(sxb,Link 1 Output); 
chan_out_word(sxb,Link20utput); 

chan_in_word(&syb,LinkOInput); 
chan_out_word(syb,Link 1 Output); 
chan_out_word(syb,Link20utput); 

if ( pict type != 1TYPE ) 

{ 

chan_in_message(size,neworg[0],LinkOInput); 

chan_out_message(size,neworg[0],LinklOutput); 

chan_out_message(size,neworg[0],Link20utput); 

chan_in_message(size,oldorgframe[OJ,LinkOInput); 

chan_out_message(size,oldorgframe[0],LinklOutput); 

chan_out_message(size,oldorgframe[0],Link2Output); 

chan_in_message(size,oldrefframe[0],LinkOInput); 
chan_out_message(size,oldre£ffame[0],LinklOutput); 
chan_out_message(size,oldrefframe[0],Link2 Output); 

if ( pict type != P TYPE ) 

{ 

chan_in_message(size,neworgframe[0],LinkOInput); 
chan_out_message(size,ncworgframe[0], Link 1 Output); 
chan_out_message(size,neworgffame[0],Link20utput); 


chan_in_message(size,newrefframe[0],LinkOInput), 
chanout_message(size. ne\vrefframe[0], Link 1 Output); 
chanout_message(size.newrcfTranie[0],Link2 Output); 

> 

} 

/* loop through 36 macroblocks of the picture */ 

motion_estimation(oldorgframe[0], neworgframe[0], 
oldrefframe[0], newrefframe[0], 
neworgfO], 
sxf, syf, 
sxb, syb, 
mbinfo); 

for (a=36; a<73; a++) 

{ 

chan_in_word(&mbinfo[a].mb_type,LinklInput); 
chan_in_word(&mbinfo[a] motiontype.Linkllnput); 
chan_in_word(&mbinfo[a].MV[0] [0] [0],Link 1 Input); 
chan_in_word(&mbinfo[a] .MV[0] [0] [ 1 ] ,Link 1 Input); 
chan_in_word(&mbinfo[a] MV[0] [ 1 ] [0],Link 1 Input); 
chan_in_word(&mbinfo[a] .MV[0] [ 1 ] [ 1 ],Link 1 Input); 
chan_in _word(&mbinfo[a].MV[ 1 ] [0] [0],Link llnput); 
chan _in_word(&mbinfo[a].MV[l ][0][1 ],Linkl Input), 
chan_in_word(&mbmfo[a].MV[l][l]lO],Linkllnput); 
chan_in_word(&mbinfo[aJ.MV[ 1 ] [ 1 ] [ 1 ],Link 1 Input); 
chan_in_word(&mbinfo[a].mv_field_sel[0][0],LinklInput); 
chan_in_word(&mbinfo[a] . mv_field_sel [0] [ 1 ],Link 1 Input); 
chan _in_word(&mbinfo[a] . mv_field_sel [ 1 ] [0] .Link 1 Input); 
chan_in_word(&mbinfo[a] . mv_field_sel [ 1 ] [ 1 ] .Link 1 Input); 
} 

for (a=73; a<l 10; a++) 

{ 

chan_in _word(&mbinfo[a] . mb_type,Link2Input); 
chan_in_word(&mbinfo[a].motion_type,Link2Input); 
chan_in_word(&mbinfo[a].MV[0][0][0],Link2Input); 
chan_in_word(&mbinfo[a] .MV[0J [0] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a] .MV[0] ( 1 ] [0] ,Link2Input); 
chan_in_word(&mbinfo(a] MV[0]f 1 J [ 1 ],Link2Input); 
chan_in_\vord(&mbinfo[a].MV[ 1 ] [0] [0],Link2Input); 
chan_in_word(&mbinfo[a] .MV[ 1 ) [0] [ 1 ],Link2Input); 
chan_in_word(&mbinfo[a] ,MV[ 1 ] [ 1 ] [0] ,Link21nput); 
chan_in_word(&mbinfo[a].MV[l 1[1 ] [ l],Link2Input); 
chan_in_word(&mbinfo[a].mv_field_sel[0][0],Link2Input); 
chan_in_word(&mbinfo[a] ,mv_field_sel [0] [ 1 ] ,Link2Input); 
chan_in_word(&mbinfo[a] ,mv_field_sel[ 1 ] [0],Link2 Input); 
chan_in_word(&mbinfo[a].mv_field_sel[l][l],Link2Input); 
} 


for (a=0; a<l 10; a+-f-) 

{ 


chan_out_word(mbinfo[a].mb_type, LinkOOutput); 
chan_out_word(mbinfo[a] motionjype, LinkOOutput); 
chan_out_word(mbinfo[a].MV[0][0][0], LinkOOutput); 
chan_out_word(mbinfo[a].MV[0][0][l],Link0Output); 
chan_out_word(mbinfo[a] .MV[0] [ 1] [0], LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [ 1 ] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[ 1 ] [0] [0], LinkOOutput); 
chan_out_word(mbinfo[a] .MV[ 1 ] [0] [ 1 ] .LinkOOutput); 
chan_out_word(mbinfo[a].MV[ 1 ] [ 1 ] [0], LinkOOutput); 
chan_out_word(mbinfo[a] . MV[ 1 ] [ 1 ] [ 1 ], LinkOOutput); 
chan_out_word(mbinfo[a].mv_field_sel[0][0], LinkOOutput); 
chan_out_word(mbinfo[a].mv_field_sel[0][l], LinkOOutput); 
chan_out_word(mbinfo[a] . mv_field_sel [ 1 ] [0] .LinkOOutput); 
chan_out_word(mbinfo[a] . mv_field_sel [ 1 ] [ 1 ] .LinkOOutput); 
} 

}while(loop<nframes); 

chan_in_word(&loop,Link 1 Input); 
chan_in_word(&loop,Link2Input); 
chan_out_word(loop, LinkOOutput); 

}/*end of main()*/ 


static void frame estimate _ANSI_ARGS_((char *org, 
char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2](2])); 

static float fullsearch _ANSI_ARGS_((char ♦org, 
char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx. 


char hx, 
char hy, 
char h, 

float distlim)); 

static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h)); 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float bdist2 _ANSl_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char *neworg, 
char *oldref, char *newref, 
char *cur, 
int sxf, 
int syf, 
int sxb, 
int syb, 

struct mbinfo *mbi) 

{ 

int i, j; 

int iminjmin.iminfjminf iminrjminr; 
int imintjmint,iminbjminb; 
int imintfjmintf.iminbfjminbf; 
int imintrjmintr,iminbrjminbr; 
float var.vO, 

float dmc,dmcf,dmcr,dmci,vmc,vmcf,vmcr,vnici; 
float dmcfleld,dmcfieldf,dmcfieldr,dmcfleldi; 
int tsel,bsel,tself,bself,tselr,bselr; 
char *mb; 


int imins[2][2] jmins[2][2]; 
int a, ii, jj; 

/* loop through 36 macroblocks of the picture */ 

for(a=146; a<182; a++) 

{ 

jj = a »4; 
ii = a - (jj«4); 
j *=jj«4; 
i = ii«4; 

mb = cur + i + width*j; 

var = variance(mb, width); 

/* for I_type pictures *1 

if (pict_type=I_TYPE) 
mbi->mb_type = MBINTRA; 

/* for P_type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg.oldref,mb,ij,sxf,syf, 

&imin,&jmin,&imint,&jmint,&iminb,&jminb, 

&dmc,&dmcfield,&tsel,&bsel,inunsjmins); 

/* select between frame and field prediction *1 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width, imin& 1 jmin& 1,16); 

> 

else 

{ 

mbi->motion_type = MC_FIELD; 
dmc = dmcfield; 

vmc = dist2(oldref+(tsel?width:0)+(imint»l)+(width«l)*(jmint»l), 
mb,width«l ,imint& 1 jmint& 1 ,8); 

vmc+= dist2(oldref+(bsel?width.0)+(iminb» 1 )+(width« 1 )*(jminb» 1 ), 
mb+ width, width« 1 ,iminb& 1 jminb& 1,8); 

} 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

vO = dist2(oldref+i+width*j,mb,width,0,0,16); 
if (4*v0>5*vmc && v0>=9*256) 

{ 

I* use MC */ 


var = vmc; 

mbi->mb_type = MB_FORWARD; 
if (mbi->motion_type=MC_FRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l), 
mbi->MV[0][0][l] = jmin - (j«l); 

} 

else /* if (mbi->motion_type=MC_FIELD) */ 

{ 

/* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - (j«l); 
mbi->MV[l][0][0] = iminb - (i«l); 
mbi->MV[l][0][l] = (jminb«l) - (j«l); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

} 

} 

else 

{ 

/* No-MC */ 
var = vO; 

mbi->mb_type = 0, 
mbi->motion_type = MCFRAME; 
mbi->MV[0] [0] [0] = 0; 
mbi->MV[0][0][l] = 0; 

} 

} 

} 


/* for B_type pictures *1 

if (pict type=B_TYPE) 

{ 

/* forward prediction */ 
ffame_estimate(oldorg,oldref,mb,ij > sxf,syf, 

&inunf,&jnunf,&imintf,&jmintf,&iminbf > &jminbf ) 

&dmcf,&dmcfieldf,&tself,&bself,iminsjinins); 

/* backward prediction */ 
frame_estimate(neworg,newref,mb,ij,sxb,syb, 

&iminr,&jminr,&imintr,&jmintr,«feiminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

t* calculate interpolated distance */ 

/* frame */ 

dmci = bdist 1 (oldref+(iminf» 1 )+width*(jminf» 1 ), 
newref+(iminr» 1 )+width*(jminr» 1), 
mb, width, iminf& 1 jminf& 1 ,iminr& 1 jminr& 1,16); 

/* top field 

dmcfieldi = bdistl(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jmintf»l), 
newref+(imintr» 1 )+(tselr?width:0)+(width« l)*(jmintr» 1), 


mb,width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1,8); 

/* bottom field 

dmcfieldi+= bdist 1 (oldref+(iminbf» 1 )+(bself?width:0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(\vidth« l)*(jniinbr» 1 ), 
mb+width,width« 1 ,iminbf& 1 jmmbf& 1 ,iminbr& 1 jminbr& 1 ,8); 


if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

/* frame, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf» l)+\vidth*(jminf» 1 ), 
newref+(iminr» 1 )+width*(jminr» 1), 
mb, width, iminf& 1 Jminf& 1 ,iminr& 1 Jminr& 1,16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

/* field, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FEELD; 

vmc = bdist2(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jtnintf»l), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1 ), 
mb,width« l,imintf& 1 jmintf& 1 ,imintr& 1 jmintr& 1 ,8); 
vmc+= bdist2(oldref+(iminbf» 1 )+(bself?width:0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width« 1 ,iminbf& 1 jminbf&l ,iminbr& 1 jminbr& 1,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(iminf»l)+width*(jminf»l),mb, 
width, imi nf& 1 jminf& 1, 16); 

} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

/* field, forward *1 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MCFLELD; 

vmc = dist2(oldref+(tself?width:0)+(imintf» 1 )+(width« 1 )*(jmintf» 1 ), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,8); 

vmc+= dist2(oldref+(bself?width:0)+(iminbf» 1 )+(width« 1 )*(jminbf» 1 ), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1,8); 

} 

else if (dmcr<dmcfieldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MB_BACKWARD, 
mbi->motion_type = MCFRAMH; 


vmc = disl2(newref+(iminr»l)+width*(jminr»l),mb, 
width,iminr& 1 jminr& 1 , 16); 

} 

else 

{ 

/* field, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi->motion_type = MCFEELD, 

vmc = dist2(newref+(tselr?width:0)+(imintr»l)+(width«l)*(jmintr»l), 
mb,width« 1 ,imintr& 1 jmintr& 1,8); 

vmc+= dist2(newref+(bselr?width:0)+(iminbr»l)+(width«l)*(jminbr»l), 
mb+width,width« 1 ,iminbr& 1 jminbr& 1,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_type=MC_FRAME) 

{ 

/* forward */ 

mbi->MV[0][0][0] = iminf - (i«l); 
mbi->MV[0][0][l] = jminf - (j«l); 

/* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - (j«l); 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward */ 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0][0][l] = (jmintf«l) - (j <<: l), 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0][l] = (jminbf«l) - <j <<: l); 
mbi->mv_field_sel[0][0] = tself; 
mbi->mv_field_sel[l][0] =bself; 

/* backward *1 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][I][l] = (jmintr«I) - 0<<1); 
mbi->MV[l][l][0] = iminbr - (i«l); 
mbi->MV[l][l][l] = (jminbr«l) - (jcci); 
mbi->mv_field_sel[0][l] = tselr; 
rnbi->mv_field_sel[l][l] = bselr; 

} 

} 

} 

mbi->var = var; 


mbi++; 


} 

} 


static void frame_estimate(char *org, char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2] [2] ) 

{ 

float dt,db,dmint,dminb; 

int imint,iminbjmintjminb; 

/* frame prediction */ 

*dframep = fullsearch(org, ref, mb, width, ij,sx,sy, 16, width, height, 
iminpjminp); 

/* predict top field from top field */ 

dt = fullsearch(org,ref,mb,width«l,ij»l,sx,sy»l,8,width,height»l, 
&imint,&jmint); 

/* predict top field from bottom field */ 

db = fullsearch(org+width,ref+width,mb,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&iminb,&jminb); 


imins[0][0] = imint; 
jmins[0][0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 

} 

else 

{ 

dmint=db; *imintp=iminb; *jmintp=jminb; *tselp=l; 

} 

/* predict bottom field from top field */ 

dt = ftillsearch(org,ref,mb+width,width«l,ij»l,sx,sy»l,8,width,height»l, 
&imint,&jmint); 

/* predict bottom field from bottom field */ 
db = fullsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb), 


imins[0][l] = imint; 
jmins[0][l] = jmint; 
imins[l][l] = iminb; 
jmins[l][l] = jminb; 

/* select prediction for bottom field */ 
if (db<=dt) 

{ 

dminb=db; *iminbp=iminb; *jminbp=jminb; *bselp=l; 

} 

else 

{ 

dminb=dt; *iminbp=imint; *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fullsearch(char *org, char *ref, 
char *blk, 
int lx, 
int iO, 
intjO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp) 

{ 

int ij,iminjmin,ilow,ihighjlowjhigh; 
float d,dmm; 
int k,l,sxy; 
int ii,jj; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) ilow = 0; 


if (ihigh>xmax-16) ihigh = xmax-16; 


jlow=jO-sy; 
jhigh =j0 + sy; 


if (jlow<0) jlow = 0; 
if (jhigh>ymax-h) jhigh = ymax-h; 

/* full pel search, spiraling outwards */ 


imin = iO; 
jmin = jO; 

dmin = dist 1 (org+imin+lx*jmin, blk, lx, 0,0,h, 65536.0); 

sxy = (sx>sy) ? sx : sy; 

for (1=1, l<=sxy; 1++) 

{ 

i = iO - 1; 
j = jO - 1; 

for (k=0, k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = dist 1 (org+i+lx*j,blk,lx,0,0,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l)j++; 
else if (k<6*l) i~; 
else j~; 

} 

} 

/* half pel */ 

dmin = 65536.0; 

imin = imin « 1; 

jmin = jmin « 1; 

llow = imin - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l»; 

jlow = jmin - (jmin>0); 

jhigh = jmin + (jmin<((ymax-h)«l)), 

for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i++) 

{ 

ii = i » 1 ; 

ii=j» i ; 

d = dist l(ref+ii+lx*jj,blk, lx, i&l j&l,h, dmin), 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin = j; 

} 

} 


*iminp = imin; 
*jminp = jmin; 

return dmin; 

} 


static float distl(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char ij; 
int v; 
float s; 

s — 0; 
pi = blkl; 
p2=blk2; 


if (!hx && !hy) 
for (j=0; j<h, j-+-»-) 

{ 

if ((v = pi [0] - p2[0])<0) v = -v; s+= (float) v; 
if ((v = pl[l] - p2[l])<0) v = -v; s+= (float) v; 
if ((v = pi [2] - p2[2])<0) v = -v; s+= (float) v; 
if ((v = pl[3] - p2[3])<0) v = -v; s+= (float) v; 
if ((v = pi [4] - p2[4])<0) v = -v; s+= (float) v; 
if((v = pl[5] -p2[5])<0) v *» -v; s+= (float) v; 
if ((v = pi [6] - p2[6])<0) v = -v; s+= (float) v; 
if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v; 
if ((v = pi [8] - p2[8])<0) v = -v; s+= (float) v; 
if ((v = pi [9] - p2[9])<0) v = -v; s+= (float) v; 
if ((v = pi 110] - p2[10])<0) v = -v; s+= (float) v; 
if ((v = pl[l 1] - p2[ 1 1 ])<0) v = -v; s+= (float) v; 
if ((v = pi [12] - p2[12])<0) v = -v; s+= (float) v; 
if ((v = pl[13] - p2[13])<0) v = -v; s+= (float) v; 
if ((v = p 1 [ 1 4] - p2[14])<0) v = -v; s+= (float) v; 
if ((v = pl[15] - p2[15])<0) v = -v; s+= (float) v; 

if (s >= distlim) 
break; 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h; j++) 

{ 


for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l)/2-p2[i]); 
if (v>=0) s+= (float) v; 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int ) ((pl[i]+pla[i]+l) / 2 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= Lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi +lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2(i]); 
if (v>=0) 
s+= (float) v; 
else 

s— (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

> 


return s; 

} 


static float dist2(char *blkl, 
char *blk2, 
int lx, 
char hx. 


char hy, 
char h ) 

{ 

char *pl,*pla,*p2, 
char i,j; 
int v; 
float s; 

s = 0; 
pi = blkl; 
p2 = blk2; 
if (!hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = pi [i] - P 2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16, i++) 

{ 

v = (int) ((pl[i]+pla[i]+l) / 2 - p2[i]), 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 



{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]), 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 

} 


static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
int v; 
float s; 

pfa = pf + hxf; 
plb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0;j<h;j++) 

{ 

for (i=0; i<16, i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 

- *p2++); 

if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 


pbb+= lx- 16; 
pbc+= lx- 16; 

} 

return s; 

} 


static float bdist2(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
intv; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for O=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 
s+= (float) (v*v); 

} 

p2+= lx-16, 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 


static float variance(char *p, int lx ) 
{ 


char i, j; 
int v; 
float s, s2; 


s = s2 = 0; 


for (j=0;j<16,j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v), 

} 

p+= lx- 16; 

} 

s=0.0625*s; 
return s2 - s*s; 

} 


/********************************/ 
/* */ 

/* m021.c residing at PE5 */ 

/* */ 

/* *******************************/ 


^include "c:\tc2v2\dos.h" 
#include "c:\tc2v2\float.h" 
#include "c:\tc2v2\math.h" 
^include "c:\tc2v2\ctype.h” 
#include "c:\tc2v2\stdio h" 
^include "c:\tc2v2\stdlib.h" 
^include "c:\tc2v2\stnng.h" 
^include "c\tc2v2\chan.h" 
^include "c:\tc2v2\thread.h" 
#include "c.\tc2v2\time.h" 
^include "c:\tc2v2\ermo.h" 


#define GLOBAL I* used by global.h *1 

^include "c:\nick\h\constl.h" 

^include "c:\nick\h\config h" 

#include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

char *neworg[l], 

void main(argc,argv,envp, ins, in_ports, outs, out_ports) 
int argc, ins, outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*out_ports[]; 

{ 

/* read parameter file */ 

chan_in_word(&M,LinkOInput); 

chan_in_word(&nframes,LinkOInput); 

initO; 

putseqO; 


static void init() 

{ 

int i, size, a; 

chan_in_word(&mb_width,LinkOInput); 

chan_in_word(&mb_height,LinkOInput); 

chan_in_word(&mb_height2,LinkOInput); 

chan_in_word(&width,LinkOInput); 

chan_in_word(&height,LinkOInput); 

chan_in_word(&width2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 


size = width*height ; 


newrefframe[0] = (char *)malloc(size); 
oldrefframe[0] = (char *)malloc(size); 
neworgframe[0] = (char *)malloc(size); 
oldorgframefO] = (char *)malloc(size); 
neworg[0] = (char *)malloc(size); 

mbinfo = (struct mbinfo *)malIoc(37*sizeof(struct mbinfo)); 

for (a=0; a<37; a++) 

{ 

mbinfo[a].mb_type = mbinfo[a].moUon_type = 0; 

mbinfo[a] . MV[0] [0] [0] = mbinfo[a].MV[0][0][l] = mbinfo[a].MV[0][l][0] = 0; 
mbinfo[a].MV[0][l][l] = mbinfo[a].MV[l][0][0] = mbinfoIa].MV[l][0][l] =0; 
mbinfo[a].MV[ 1 ] [ 1 ] [0] = mbinfo[a].MV[l][l][l] = 0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0; 
mbinfo[a].mv_field_sel[l][0] = mbinfo[a].mv_field_sel[l][l] = 0; 

} 


} 


void putseqO 

{ 

/* this routine assumes (N % M) == 0 */ 
int ij, k, f, fO, n, np, nb; 
int a; 

int sxf, syf, sxb, syb; 
int size, loop; 

size = width*height; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 

if(loop=nframes) break; 

chan_in_word(&pict_type,LinkOInput); 

chan_in_word(&sxf,LinkOInput); 

chan_in_word(&syf,LinkOInput), 

chan_in_word(&sxb,LinkOInput); 

chan_in_word(&syb,LinkOInput); 

if ( pict_type != I TYPE ) 

{ 

chan_in_message(size,neworg[0],LinkOInput); 

chan_in_message(size,oldorgframe[0],LinkOInput); 

chan_in_message(size,oldrefframe[0],LinkOInput); 

if ( pict type != P TYPE ) 

{ 


chan_in_message(size,neworgframe[0],Link0Input); 
chanin _message(size,nevvrefframe[0],LinkOInput); 

} 

} 

/* loop through 37 macroblocks of the picture */ 

motion_estimation(oldorgframelO] , neworgframeJO] , 
oldrefframe[0], newreflrame[0], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo); 

for (a=0; a<37; a++) 

{ 

chan_out_word(mbinfo[a].mb_type,LinkOOutput); 
chan_out_word(mbinfo[a].motion_type,LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [0] [0),LinkOOutput); 
chan_out_word(mbinfo[a].MV[0] [0] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a].MV[0][l][0],Link0Output); 
chan_out_word(mbinfo[a].MV[0] [ 1 ] [ 1 ],LinkOOutput), 
chan_out_word(mbinfo[a].MV[ 1 ][0][0],Link0Output), 
chan_out_word(mbinfo[a] MV[ 1 ] [0] [ 1 ],LinkOOutput) , 
chan_out_word(mbinfo[a].MV[ 1 ] [ 1] [0],LinkOOutput); 
chanout _word(mbinfo[a].MV[ 1 ] [ 1 ] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a].mv_field_sel[0][0],Link0Output) 
chan_out_word(mbinfo[a] . mv_field_sel [0] [ 1 ] ,LinkOOutput) 
chan_out_word(mbinfo[a] .mv_field_sel[ 1] [0],LinkOOutput) 
chan_out_word(mbinfo[a] .mv_field_sel[ 1] [ l],LinkOOutput) 
} 

}while(loop<nframes); 

chan_out_word(!oop,LinkOOutput); 


} 


static void ffame_estimate _ANSI_ARGS_((char *org, 
char *ref, 
char *mb, 
inti, intj, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2])); 

static float fullsearch _ANSI_ARGS_((char *org, 
char *ref, 
char *blk, 


int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 

float distlim)); 

static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h)); 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float bdist2 _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char *neworg, 
char *oldref, char *newref, 
char *cur, 
int sxf, 
int syf, 


int sxb, 
int syb, 

struct mbinfo *mbi) 

{ 

intij, p, q; 

int imin jnun,inunfjminf,iminrjminr; 
int imint jmint,iminbjminb; 
int imintf jmintf.iminbfjminbf; 
int inuntr jmintr,iminbr jminbr; 
float var,vO, 

float dmc,dmcf,dmcr,dmci, vmc,vmcf, vmcr,vmci ; 
float dmcfield,dmcfieldf,dmcfieldr,dmcfieldi; 
int tsel,bsel,tself,bself,tselr,bselr; 
char *mb; 

int imins[2](2]jmins[2][2]; 
int a, ii, ij; 

/* loop through 37 macroblocks of the picture */ 

for(a=182; a<219; a++) 

{ 

jj = a»4; 
ii = a - (ij«4); 
j=ij«4; 
i = ii«4; 


mb = cur + i + width*j; 


var = variance(mb, width); 

/* for I type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MBINTRA; 

/* for P type pictures */ 

if (pict_type=P_TYPE) 

{ 

fTame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&imin,&jmin,&imint,&jmint,&iminb,&jminb, 

&dmc,&dmcfleld,&tsel,&bsel,iminsjmins); 

/* select between frame and field prediction */ 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(imin» 1 )+width*(jmin» 1 ),mb, 
width, imin& 1 jmin& 1,16); 

} 

else 

{ 

mbi->motion_type = MCFIELD, 
dmc = dmcfield; 



vmc = dist2(o!dref+(tsel?width 0)+(imint»l)+(width«l)*(jmint»l), 
mb,width« 1 ,imint& 1 jmint& 1 ,8); 

vmc+= dist2(oldref+(bsel?width:0)+(iminb» 1 )+(width« l)*(jminb» 1 ), 
mb+width,width« 1 ,iminb& 1 jminb& 1,8), 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

vO = dist2(oldref+i+width*j,mb,width,0,0,16); 
if (4*vO>5*vmc && vO>=9*256) 

{ 

/* use MC */ 
var = vmc; 

mbi->mb_type = MBFORWARD; 
if (mbi->motion_type=MC_FRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0][l] = jmin - (j«l); 

} 

else /* if (mbi->motion_type=MC_FIELD) */ 

{ 

/* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - 
mbi->MV[l][0][0] = iminb - (i«l), 
mbi->MV[l][0][l] = Ominb«l) - (j«l); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

} 

} 

else 

{ 

/* No-MC */ 
var = vO, 

mbi->mb_type = 0; 
mbi->motion_type = MC_FRAME; 
mbi->MV[0] [0] [0] = 0; 
mbi->MV[0][0][l] = 0; 

} 

} 


/* for B type pictures */ 

if (pict_type==B_TYPE) 

{ 

/* forward prediction */ 
frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf,&dmcfieldf,&tself,&bself,iminsjmins); 


/* backward prediction */ 

ffame_estimate(neworg,newref,mb,ij,s.\b,syb, 

&iminr,&jminr,&iminlr,&jmintr,&iminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins), 

/* calculate interpolated distance */ 

/* frame */ 

dmci = bdistl(oldref+(iminf»l )+\vidth*(jmmf»l), 
newref+(iminr» 1 )+width*(jminr» 1), 
mb,width,iminf& 1 jminf& 1 ,iminr& 1 jminr& 1 , 1 6); 

/* top field 

dmcfieldi = bdist I(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jminti>> 1), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1 ), 
mb,width« 1 ,imintf& 1 Jmintf& 1 ,imintr& 1 jmintr& 1,8); 

/* bottom field 

dmcfieldi+= bdist 1 (oIdref+(iminbf» 1 )+(bself?width:0)+(width«l )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1 ,iminbr& 1 jminbrA 1,8); 


if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

/* frame, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf» 1 )+width*(jminf» 1 ), 
newref+(iminr» 1 )+width*(jminr»l ), 
mb,width,iminf& 1 jminf& 1 ,iminr& 1 jminr& 1,16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

/* field, interpolated */ 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MCFIELD; 

vmc = bdist2(oldref+(imintf»l)+(tselP\vidth:0)+(\vidth«l)*(jmintf»l), 
newref+(imintr» 1 )+(tselr?width:0)+(width«l)*(jmintr»l), 
mb,width« 1 ,imintf& 1 jmintf&l ,imintr& 1 jmintr& 1,8); 
vmc+= bdist2(oldref+(iminbf» 1 )+(bself?width:0)+(width« 1 )*(jminbf» 1 ), 
newref+(iminbr» 1 )+(bselr?\vidth:0)+(width« l)*(jnunbr» 1 ), 
mb+width,width« 1 ,iminbf& 1 jminbf& l,iminbr& 1 jminbrA 1 ,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

/* frame, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(iminf»l)+width*(jminf»l),mb, 
width, iminf&l jminf&l,16); 

} 

else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 


{ 

/* field, forward */ 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MCFIELD, 

vmc = dist2(oldref+(tself?width:0)+(immtf>> 1 )+(width« 1 )*(jmintf» 1 ), 
mb,width«l,imintf&l jmintf&l,8); 

vmc+= dist2(oldref+(bself’width:0)+(iminbf» 1 )+(width« 1 )*(jminbf» 1 ), 
mb+width,width« 1 ,iminbf& 1 jminbf& 1 ,8); 

} 

else if (dmcr<dmcfieldr) 

{ 

1* frame, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(newref+(iminr»l)+width*(jminr»l),mb, 
width, inunr& 1 jminr& 1,16); 

} 

else 

{ 

I* field, backward */ 
mbi->mb_type = MBBACKWARD; 
jnbi->motion_type = MCFIELD; 

vmc = dist2(newref+(tselr?width:0)+(imintr» 1 )+(width« 1 )*(jmintr» 1 ), 
mb,width« 1 ,imintr& 1 jmintr& 1 ,8); 

vmc+= dist2(newref+(bselr?width:0)+(iminbr» 1 )+(\vidth«l)*(jminbr» 1 ), 
mb+width,width« 1 ,iminbr& 1 jminbr& 1 ,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

var - vmc; 

if (mbi->motion_type=MC_FRAME) 

{ 

I* forward */ 

mbi->MV[0][0][0] = iminf - (i«l); 
mbi->MV[0][0][l] = jminf - (j«l); 

/* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - (j«l); 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward */ 

mbi->MV[0] [0] [0] = irnintf - (i«l); 
mbi->MV[0][0][l] = (jmintf«l) - (j«l); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0][l] = (jminbf«l) - (j«l); 
mbi->mv _field_sel[0][0] = tself; 
mbi->mv_field_sel[l][0] = bself; 

/* backward */ 


mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - (j«l); 
mbi->MV[l][l][0] = iminbr - (i«J); 
mbi->MV[l][l][l] = (jminbr«l) - (j«l); 
mbi->mv_field_sel[0][l] = tselr; 
mbi->mv_field_sel[l](l] = bselr; 

} 

} 

} 


mbi->var = var; 


mbi++; 


} 

} 


static void firame_estimate(char *org. char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp. int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db,dmint,dminb; 
int imint,iminbjmintjminb, 

/* frame prediction *1 

♦dframep = fullsearch(org, ref.mb, width, ij,sx,sy, 16,width,height, 
iminpjminp); 

/* predia top field from top field */ 

dt = fullsearch(org,ref,mb ) width«l,ij»l,sx,sy» 1,8, width, height»l, 
&imint,&jmint); 

I* predict top field from bottom field */ 

db = fiillsearch(org+width,ref+width,mb,width«l ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&iminb,&jminb); 

imins[0][0] = imint; 
jmins(0][0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 

} 


else 

{ 

dmint=db; *imintp=iminb; *jmintp=jminb; *tselp=l; 

} 

/* predict bottom field from top field */ 

dt = fullsearch(org ? ref,mb+width, width<< 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&imint,&jmint); 

/* predict bottom field from bottom field */ 
db = fiillsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 

imins[0][l] = imint; 
jmins[0][l] = jmint; 
imins[l][l] = iminb; 
jmins[l][l] = jminb; 

/* select prediction for bottom field */ 
if (db<=dt) 

{ 

dminb^db; *iminbp=iminb; *jminbp=jminb; *bselp=l; 

} 

else 

{ 

dminb=dt; *iminbp=imint; *jminbp==jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fiillsearch(char *org, char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
int sy, 
char h, 
int xmax, 
int ymax, 
int *immp, 
int *jminp) 

{ 

int ij,iminjmin,ilow ? ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 
int ii, jj; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) ilow = 0; 
if (ihigh>xmax-16) ihigh = xmax-16; 


jlow = jO - sy; 
jhigh = jO + sy; 


if (jlow<0) jlow = 0; 

if (jhigh>ymax-h) jhigh = ymax-h, 

/* full pel search, spiraling outwards */ 

imin = iO; 
jmin=jO; 

dmin = distl(org+imin+lx*jmin,blk,lx,0,0,h,65536.0); 


sxy = (sx>sy) ? sx : sy; 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 
j = jO - 1; 

for (k=0; k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = distl(org+i+lx*j,blk,lx,0,0,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

if (k<2*l) i++; 
else if (k<4*l) j++; 
else if (k<6*l) i— ; 
else j--; 

} 

} 

/* half pel */ 

dmin = 65536.0; 

imin = imin« 1; 

jmin = jmin « 1; 

ilow = imin - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin^), 

jhigh = jmin + (jmin<((ymax-h)«l)), 


for (j=jlow; j<=jhigh; j++) 


for (i=ilow; i<=ihigh, i++) 

{ 

ii - i » 1 ; 

ii=j» i ; 

d = distl(ref+ii+lx*jj,blk,lx,i&l j&l,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmm = j; 

} 

} 

♦iminp = imin; 

♦jminp =jmin; 

return dmin, 

} 


static float distlfchar *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0; 
pi =blkl; 
p2 = blk2; 

if (!hx && !hy) 
for (j=0;j<h;j++) 

{ 

if((v = pl[0] - p2[0))<0) v = -v; s+= (float) v; 
if ((v = p 1 [ 1 ] - p2[l])<0) v = -v; s+= (float) v; 
if((v = pl[2] -p2[2])<0) v = -v; s+= (float) v; 
if((v = pl[3] -p2[3])<0) v = -v; s+= (float) v; 
if ((v = pi [4] - p2[4])<0) v = -v; s+= (float) v; 
if ((v = pi [5] - p2[5])<0) v = -v; s+= (float) v; 
if((v = pl[6] -p2[6])<0) v = -v; s+= (float) v; 
if((v = pl[7] -p2[7])<0) v = -v; s+= (float) v; 
if((v = pl[8] -p2[8])<0) v = -v; s+= (float) v; 
if ((v = pi [9] - p2[9])<0) v = -v; s+= (float) v; 
if ((v = pi [10] - p2[10])<0) v = -v; s+= (float) v 
if ((v = pl[l 1] - p2[ll])<0) v = -v; s+= (float) v 
if ((v = pl[12] - p2[12])<0) v = -v; s+= (float) v 


if ((v = plf 13J - p2[13])<0) v = -v; s+= (float) v; 
if ((v = pi [ 14] * p2[14])<0) v = -v; s+= (float) v, 
if ((v = pi [15] - p2[15])<0) v = -v; s+= (float) v; 


if (s >= distlim) 
break; 


pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for (j=0;j<h;j-H-) 

{ 

for (i=0, i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i)); 
if (v>=0) s+= (float) v; 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for O=0;j<h;j-H-) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int ) ((pl[i]+pla[i]+l) / 2 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

pi =pla; 


pla+= lx; 
p2+= lx; 

} 

} 

return s; 

} 


static float dist2(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0; 
pi = blkl; 
p2 = blk2; 
if (!hx && !hy) 
for o=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = pl[i] - P 2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for o=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pla[i]+l) / 2 - p2[i]); 


s+= (float) (v*v); 

} 

pi =pla; 
pla+= lx, 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0J<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
s+= (float) (v*v), 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

return s; 

} 


static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc, 
char i,j; 
int v; 
float s; 


pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + Ix*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0, j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 


if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 


static float bdist2(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
intv; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 

pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s — 0; 

for O=0,j < h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 

s+= (float) (v*v); 

} 

p2+= lx-16, 
pf+= lx-16; 
pfa+= lx-16; 
pfb+= lx-16; 


pfc+= lx- 16; 
pb+= lx- 16; 
pba+= lx- 16 
pbb+= lx- 16 
pbc+= lx- 16 

} 

return s; 

} 


static float variance(char *p, int lx ) 

{ 

char i, j; 
int v; 
float s, s2; 


s = s2 = 0; 


for (j=0;j<16;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

} 

p+= lx-16; 

} 

s=0.0625*s; 
return s2 - s*s; 

} 



/********************************/ 
/* */ 


/* m022.c residing at PE6 */ 

/* */ 
/********************************/ 


^include "c:\tc2v2\dos h" 
include "c:\tc2v2\float.h” 
^include "c:\tc2v2\math.h" 
include "c:\tc2v2\ctype h" 
#include "c \tc2v2\stdio.h" 
^include "c:\tc2v2\stdlib.h" 
#include "c:\tc2v2\string.h" 
^include "c:\tc2v2\chan.h" 
^include "c:\tc2v2\thread.h" 
#include "c:\tc2v2\time.h" 
#include "c:\tc2v2\ermo.h" 


#define GLOBAL /* used by global.h */ 

^include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

^include "c:\nick\h\global.h” 

static void init _ANSI_ARGS_((void)); 

char *ne\vorg[l]; 

void main(argc,argv,envp, ins, in_ports, outs, out_ports) 
int argc,ins,outs; 
char *argv[],*envp[]; 

CHAN * i n_ports[ ] , *out_ports [] ; 

{ 

/* read parameter file */ 

chan_in_word(&M,LinkOInput); 

chan_in_word(&nframes,LinkOInput); 

init(); 

putseqO; 


} 

static void init() 

{ 

int i, size, a; 

chan_in_word(&mb_width,LinkOInput); 

chan_in_word(&mb_height,LinkOInput); 

chan_in_word(&mb_height2,LinkOInput); 

chan_in_word(&width,LinkOInput), 

chan_in_\vord(&height,LinkOInput); 

chan_in_word(&\vidth2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 


size = width* height ; 


ne\vrefframe[0] = (char *)malloc(size); 
oldrefiframe[0] = (char *)malloc(size); 
neworgframe[0] = (char *)malloc(size); 
oldorgframe[0] = (char *)malloc(size); 
neworgfO] = (char *)malloc(size); 

mbinfo = (struct tnbinfo *)malloc(37*sizeof(struct mbinfo)); 

for (a=0; a<37; a++) 

{ 

mbi nfo [a] . mbtype = mbinfo [a] . motiontype = 0; 

mbinfofa] . MV[0] [0] [0] = mbinfo[aJ.MV[0][0][lJ = mbinfo[a].MV[0][l][0] = 0; 
mbinfo[a].MV[0][l][l] = mbinfo[a].MV[l][0][0] = mbinfo[a].MV[l][0][l] = 0; 
mbinfo[a].MV(l]ll][0] = mbinfola].MV[l][l][l] = 0; 
mbinfo[a].mv_field_sel[0][0] = mbinfo[a].mv_field_sel[0][l] = 0; 
mbinfo[a].mv_field_sel[l][0] = mbinfo[a].mv_field_sel[l][l] = 0, 

} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, k, f, fD, n, np, nb; 
int a; 

int sxf, syf, sxb, syb; 
int size, loop; 

size = width*height; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,Link01nput); 

if(loop=nframes) break; 

chan_in_word(&pict_type,LinkOInput); 

chan_in_word(&sxf,LinkOInput); 

chan_in_word(&syf,LinkOInput); 

chan_in_word(&sxb,LinkOInput); 

chan_in_word(&syb,LinkOInput); 

if ( pict type != I_TYPE ) 

{ 

chan_in_message(size,neworg[0],LinkOInput); 
chan_in_message(size,oldorgframe[0],LinkOInput); 
chan_in_message(size,oldrefffame[0],LinkOInput); 
if ( pict type != P TYPE ) 

{ 

chan_in_message(size,neworgframe[0],LinkOInput); 



chan_in_message(size,ne\vre£frame{0],LinkOInput), 

} 

} 

/* loop through 37 macroblocks of the picture */ 

motion_estimation(oldorgframe[0], neworgframe[0], 
oldrefiframe[0], newrefframe[0], 
neworg[0], 
sxf, syf, 
sxb, syb, 
mbinfo); 

for (a=0; a<37; a++) 

{ 

chan_out_\vord(mbinfola].mb_type, LinkOOutput); 
chan_out_word(mbinfo[a].motion_type, LinkOOutput); 
chan_out_word(mbinfo[a].MV[0][0][0], LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [0] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a] .MV[0] [ 1 ] [0],LinkOOutput); 
chan_out_word(mbinfo[a] MV[0] [ 1 ] [ 1 ],LinkOOutput); 
chan_out_word(mbinfo[a].MV[ 1 ] [0] [0],LinkOOutput); 
chan_out_word(mbinfo[a] ,MV[ 1 ] [0] [ 1 ] .LinkOOutput); 
chanout _word(mbinfo[a] .MV[ 1 ] [ 1 ] [0] .LinkOOutput); 
chan_out_word(mbinfo[a] ,MV[ 1 ] [ 1 ] [ 1 ] , LinkOOutput); 
chan_out_word(mbinfo[a] . mv_field_sel [0] [0] .LinkOOutput); 
chan_out_word(mbinfo[a].mv_field_sel[0][l], LinkOOutput); 
chan_out_word(mbinfo[a].mv_field_sel[l][0], LinkOOutput); 
chan_out_word(mbinfoIa].mv_field_sel[l][l], LinkOOutput); 
} 

}while(loop<nframes); 
chan_out_word(loop, LinkOOutput); 


static void frame estimate _ANSI_ARGS_((char *org, 
char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2])); 

static float fullsearch _ANSI_ARGS_((char *org, 
char *ref, 
char *blk, 
int lx, 
int iO, 


intjO, 
int sx, 
int sy, 
char h, 
int xmax, 
intymax, 
int *iminp, 
int *jminp)); 

static float distl _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 

float distlim)); 

static float dist2 _ANSI_ARGS_((char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h»; 

static float bdistl _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float bdist2 _ANSI_ARGS_((char *pf, 
char *pb, 
char *p2, 
int lx, 
int hxf, 
int hyf, 
int hxb, 
int hyb, 
char h)); 

static float variance _ANSI_ARGS_((char *p, int lx)); 


void motion_estimation(char *oldorg, char *neworg, 
char *oldref, char *newref, 
char *cur, 
int sxf, 
int syf, 
int sxb, 
int syb, 



struct mbinfo *mbi) 


{ 

inti.j, p, q; 

int iminjmm,iminfjminf,iminrjminr; 
int imint jmint,iminbjminb; 
int imintfjmintf,immbfjminbf; 
int imintrjmintr.iminbrjminbr; 
float var.vO; 

float dmc,dmcf,dmcr,dmci,vmc,vmcf,vmcr,vmci; 
float dmcfield,dmcfieldf,dmcfieldr,dmcfieldi; 
int tsel,bsel,tself,bself,tselr,bselr; 
char *mb; 

int imins[2][2] jmins[2][2]; 
int a, ii, jj; 

/* loop through 37 macroblocks of the picture */ 

for(a=219; a<256; a++) 

{ 

ij = a»4; 
ii = a - (jj«4); 
j =jj«4; 
i = ii«4; 


mb = cur + i + width*j, 


var = variance(mb, width); 

/* for I type pictures */ 

if (pict_type=I_TYPE) 
mbi->mb_type = MBINTRA; 

/* for P_type pictures */ 

if (pict_type=P_TYPE) 

{ 

frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&imin,&jmin,&imint,&jmint,&iminb,&jminb, 

&dmc,&dmcfield,&tsel,&bsel,iminsjmins); 

/* select between frame and field prediction *1 
if (dmc<=dmcfield) 

{ 

mbi->motion_type = MC_FRAME; 
vmc = dist2(oldref+(imin»l)+width*(jmin»l),mb, 
width, imin& 1 jmin& 1,16), 

} 

else 

{ 

mbi->motion_type = MCFIELD, 
dmc = dmcfield; 

vmc = dist2(oldref+(tsel?width:0)+(inunt»l)+(width«l)*(jmint»l), 
mb,width« 1 ,imint& 1 jmint& 1 ,8); 


vmc+= dist2(oldref+(bsel?width.0)+(iminb»l)+(width«l)*(jminb»l), 
mb+width,\vidth« 1 ,imi nb& 1 jminb& 1,8); 

} 

if (vmc>var && vmc>=9*256) 
mbi->mb_type = MB_INTRA; 
else 
{ 

vO = dist2(oldref+i+width*j, mb, width, 0,0, 16); 
if (4*vO>5*vmc && vO>=9*256) 

{ 

/♦ use MC */ 
var = vmc; 

mbi->mb_type = MB_FORWARD; 
if (mbi->motion_type=MC_FRAME) 

{ 

mbi->MV[0][0][0] = imin - (i«l); 
mbi->MV[0][0][l] = jmin - (j <<: 1); 

} 

else t* if (mbi->motion_type==MC_FIELD) */ 

{ 

/* these are FRAME vectors */ 
mbi->MV[0][0][0] = imint - (i«l); 
mbi->MV[0][0][l] = (jmint«l) - (j«l); 
mbi->MV[l][0][0] = iminb - (i«l); 
mbi->MV[l][0][l] = (jminb«l) - G^l); 
mbi->mv_field_sel[0][0] = tsel; 
mbi->mv_field_sel[l][0] = bsel; 

} 

} 

else 

{ 

/* No-MC */ 
var = vO; 

mbi->mb_type = 0; 
mbi->motion_type = MC_FRAME; 
mbi->MV[0] [0] [0] = 0; 
mbi->MV[0][0][l] =0; 

} 

} 

} 


/* for B type pictures */ 

if (piettype— BTYPE) 

{ 

/* forward prediction */ 
frame_estimate(oldorg,oldref,mb,ij,sxf,syf, 

&iminf,&jminf,&imintf,&jmintf,&iminbf,&jminbf, 

&dmcf,&dmcfieldf,&tself,&bself,iminsjmins); 


/* backward prediction */ 
frame_estimate(neworg,newref,mb,ij,sxb,syb, 



&iminr,&jminr,&imintr,&jmintr,&iminbr,&jminbr, 

&dmcr,&dmcfieldr,&tselr,&bselr,iminsjmins); 

/* calculate interpolated distance */ 

/* frame */ 

dmci = bdistl(oldref+(iminfi > >l)+width*(jminf^>l), 
newref+(iminr» 1 )+width*(jminr» 1), 
mb, width, iminf& 1 jminf& 1 ,iminr&l jminr& 1,16), 

I* top field 

dmcfieldi = bdist 1 (oldref+(imintf»l)+(tself?width:0)+(width«l )*(jmintf» 1 ), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,muntr& 1 jmintr& 1,8); 

/* bottom field 

dmcfieldi+= bdistl(oldref+(iminbf»l)+(bself?width:0)+(width«l)*(jminb£»l), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+width,width« 1 ,iminbf& 1 jmtnbf& 1 ,iminbr& 1 jminbr& 1,8), 

/* select prediction type of minimum distance from the 
* six candidates (field/frame * forward/backward/interpolated) 

*/ 

if (dmci<dmcfieldi && dmci<dmcf && dmci<dmcfieldf 
&& dmci<dmcr && dmci<dmcfieldr) 

{ 

I* frame, interpolated *1 

mbi->mb_type = MB_FORWARD|MB_BACKWARD; 
mbi->motion_type = MC_FRAME; 
vmc = bdist2(oldref+(iminf» 1 )+\vidth*(jminf» 1 ), 
ne\vref+(iminr» 1 )+width*(jminr» 1), 
mb, width,iminf& 1 jminf& 1 ,iminr& 1 jminr& 1 , 16); 

} 

else if (dmcfieldi<dmcf && dmcfieldi<dmcfieldf 
&& dmcfieldi<dmcr && dmcfieldi<dmcfieldr) 

{ 

/* field, interpolated */ 

mbi->mb_type = MBFORW ARD|MB_B ACKW ARD, 
mbi->motion_type = MC_FEELD; 

vmc = bdist2(oldref+(imintf»l)+(tself?width:0)+(width«l)*(jinintf»l), 
newref+(imintr» 1 )+(tselr?width:0)+(width« 1 )*(jmintr» 1 ), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,imintr& 1 Jmintr& 1 ,8); 
vmc+= bdist2(oldref+(iminbf»l)+(bself?width:0)+(\vidth«l)*(jminbf»l), 
newref+(iminbr» 1 )+(bselr?width:0)+(width« 1 )*(jminbr» 1 ), 
mb+ width, width« 1 ,iminbf&l jminbf& 1 ,iminbr& 1 jminbr& 1,8); 

} 

else if (dmcf<dmcfieldf && dmcf<dmcr && dmcf<dmcfieldr) 

{ 

f* frame, forward *1 
mbi->mb_type = MBFORWARD; 
mbi->motion_type = MCFRAME; 
vmc = dist2(oldref+(iminl>> 1 )+width*(jminf >> 1 ),mb, 
width, iminf& 1 jminf& 1 , 1 6); 


> 


else if (dmcfieldf<dmcr && dmcfieldf<dmcfieldr) 

{ 

/* field, forward */ 
mbi->mb_type = MB_FORWARD; 
mbi->motion_type = MC_FIELD; 

vmc = dist2(oldref+(tself?width:0)+(imintf» 1 )+(width« 1 )*(jmintf» 1), 
mb,width« 1 ,imintf& 1 jmintf& 1 ,8); 

vmc+= dist2(oldref+(bself?width:0)+(iminbf» 1 )+(width« l)*(jminbf» 1 ), 
mb+width, width« 1 ,iminbf& 1 jminbf& 1,8); 

} 

else if (dmcr<dmcfieldr) 

{ 

/* frame, backward */ 
mbi->mb_type = MB_B ACK W ARD ; 
mbi->motion_type = MC_FRAME; 
vmc = dist2(newref+(iminr»l)+width*(jminr»l),mb, 
width, iminr&l jminr&l,16); 

} 

else 

{ 

/* field, backward */ 
mbi->mb_type = MBBACKWARD; 
mbi->motion_type = MC_FIELD; 

vmc = dist2(newref+(tselr?width:0)+(imintr»l)+(width«l)*(jminu»l), 
mb, width« 1 ,imintr& 1 jmintr& 1,8); 

vmc+= dist2(newref+(bselr?width:0)+(iminbr» l)+(width« 1 )*(jminbr» 1 ), 
mb+width, width« 1 ,iminbr& 1 jminbr& 1,8); 


if (vmc>var && vmc>=9*256) 
mbi->mb_type = MBINTRA; 
else 
{ 

var = vmc; 

if (mbi->motion_type=MC_FRAME) 

{ 

/ * forward */ 

mbi->MV[0] [0] [0] = iminf - (i«l), 
mbi->MV[0][0][l] = jminf - (j <<: l); 

/* backward */ 

mbi->MV[0][l][0] = iminr - (i«l); 
mbi->MV[0][l][l] = jminr - (j«l), 

} 

else 

{ 

/* these are FRAME vectors */ 

/* forward */ 

mbi->MV[0][0][0] = imintf - (i«l); 
mbi->MV[0][0][l] = (jmintf«l) - (j«l); 
mbi->MV[l][0][0] = iminbf - (i«l); 
mbi->MV[l][0)[l] = (jminbf«l) - (j«l); 
mbi->mv_field_sel[0][0] = tself; 
mbi->mv_field_sel[l][0] = bself; 



/* backward *1 

mbi->MV[0][l][0] = imintr - (i«l); 
mbi->MV[0][l][l] = (jmintr«l) - 0«1); 
mbi->MV[l][l][0] = iminbr - (i«l); 
mbi->MV[l][l][l] = (jminbr«l) - (j«l); 
mbi->mv_field_sel[0][l] = tselr; 
mbi->mv_field_sel[l][l] =bselr, 

} 

} 

} 

mbi->var = var, 
mbi++; 


} 

} 


static void frame_estimate(char *org, char *ref, 
char *mb, 
int i, int j, 
int sx, int sy, 
int *iminp, int *jminp, 
int *imintp, int *jmintp, 
int *iminbp, int *jminbp, 
float *dframep, float *dfieldp, 
int *tselp, int *bselp, 
int imins[2][2], int jmins[2][2] ) 

{ 

float dt,db,dmint > dminb; 
int imint,iminbjmintjminb; 

/* frame prediction */ 

♦dframep = ftillsearch(org,ref,mb,width,ij,sx,sy,16,width,height, 
iminpjminp); 

/* predict top field from top field */ 

dt = fullsearch(org > ref,mb ) width«l,ij»l,sx,sy»l,8 ) width,height»l, 
&imint,&jmint); 

/* predict top field from bottom field */ 

db = fullsearch(org+width,ref+width,mb,width« 1 ,i j» 1 ,sx,sy» 1 ,8,width,height» 1 , 
&iminb,&jminb); 

imins[0][0] = imint; 
jmins[0][0] = jmint; 
imins[l][0] = iminb; 
jmins[l][0] = jminb; 

/* select prediction for top field */ 
if (dt<=db) 

{ 

dmint=dt; *imintp=imint; *jmintp=jmint; *tselp=0; 


} 

else 

{ 

dmint=db; *imintp=iminb; *jmintp=jminb; *tselp=l; 

} 

/* predict bottom field from top field *1 

dt = fullsearch(org,ref,mb+width,width«l,ij»l,sx,sy»l,8,width,height»l, 
&imint,&jmint); 

/* predict bottom field from bottom field */ 
db = fullsearch(org+width, ref+width, mb+width, 

width«l, i, j»l, sx, sy»l, 8, width, height»l, 

&iminb,&jminb); 

imins[0][l] = imint; 
jmins[0][l] = jmint; 
imins[l][l] = iminb; 
jmins[l][l] =jminb; 

I* select prediction for bottom field */ 
if (db<=dt) 

{ 

dminb=db, ’ t iminbp=iminb; *jminbp=jminb; *bselp=l; 

} 

else 

{ 

dminb=dt; *iminbp=imint; *jminbp=jmint; *bselp=0; 

} 

*dfieldp=dmint+dminb; 

} 


static float fullsearch(char *org, char *ref, 
char *blk, 
int lx, 
int iO, 
int jO, 
int sx, 
intsy, 
char h, 
int xmax, 
int ymax, 
int *iminp, 
int *jminp) 

{ 

int ij,iminjmin,ilow,ihighjlowjhigh; 
float d,dmin; 
int k,l,sxy; 
int ii, jj; 


ilow = iO - sx; 
ihigh = iO + sx; 


if (ilow<0) ilow = 0, 


if (ihigh>xmax-16) ihigh = xmax-16; 

jlow = jO - sy; 
jhigh = jO + sy; 

if (jlow<0) jlow = 0; 

if (jhigh>ymax-h) jhigh = ymax-h; 

/* full pel search, spiraling outwards */ 

imin = iO; 
jmin=jO; 

dmin = distl(org+imin+lx*jmin,blk, lx, 0,0, h, 65536.0); 

sxy = (sx>sy) ? sx : sy; 

for (1=1; l<=sxy; 1++) 

{ 

i = iO - 1; 
j = jO - 1; 

for (k=0; k<8*l; k++) 

{ 

if (i>=ilow && i<=ihigh && j>=jlow && j<=jhigh) 

{ 

d = distl(org+i+lx*j,blk,lx,0,0,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

if (k<2*l) i++; 
else if (k <: 4*l) j++; 
else if (k<6*l) i~; 
else j~; 

} 

} 

/* half pel */ 

dmin = 65536.0; 

imin = imin « 1; 

jmin = jmin « 1; 

ilow = imin - (imin>0); 

ihigh = imin + (imin<((xmax-16)«l)); 

jlow = jmin - (jmin>0); 

jhigh = jmin + (jmin<((ymax-h)«l)); 


for (j=jlow; j<=jhigh; j++) 
for (i=ilow; i<=ihigh; i++) 

{ 

ii = i » 1 ; 

jj = j >> 1 . 

d = distl(ref+ii+lx*jj,blk,lx,i&l j&l,h,dmin); 

if (d<dmin) 

{ 

dmin = d; 
imin = i; 
jmin =j; 

} 

} 

*iminp = imin; 

♦jminp = jmin; 

return dmin; 

} 


static float distl(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h, 
float distlim ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0, 
pi = blkl; 
p2 = blk2; 

if (!hx && !hy) 
for (j=0;j<h;j++) 

{ 

if ((v = pi [0] * p2[0))<0) v = -v; s+= (float) v, 

if ((v = pi [1] - p2[l ])<0) v = -v; s+= (float) v; 

if ((v = pi [2] - p2[2])<0) v = -v; s+= (float) v; 

if((v = pl[3] -p2[3])<0) v = -v; s+= (float) v; 

if ((v = pi [4] - p2[4])<0) v = -v; s+= (float) v; 

if ((v = pi [5] - p2[5])<0) v = -v; s+= (float) v; 

if((v = pl[6] -p2[6])<0) v = -v; s+= (float) v; 
if ((v = pi [7] - p2[7])<0) v = -v; s+= (float) v; 

if ((v = pi [8] - p2[8])<0) v = -v; s+= (float) v; 

if((v = pl[9] - p2[9])<0) v = -v; s+= (float) v; 
if ((v = pi [10] - p2[10])<0) v = -v; s+= (float) v 
if ((v = pi [1 1] - p2[l 1])<0) v = -v, s+= (float) v 


if ((v = pi [ 12] - p2[12])<0) v = -v; s+= (float) v, 
if ((v = pi [13] - p2[13])<0) v = -v, s+= (float) v, 
if ((v = pi [14] - p2[14])<0) v = -v; s+= (float) v; 
if ((v = p 1 [ 1 5] - p2 [ 1 5])<0) v = -v, s+= (float) v; 

if (s >= distlim) 
break; 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for 0=0; j<h, j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l) / 2 - p2[i]); 
if (v>=0) s+= (float) v; 
else s-= (float) v; 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx && hy) 

{ 

pla = pi + lx; 
for 0=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int ) «pl[i]+pla[i]+l) / 2 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s— (float) v; 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi +lx; 
for O=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2)/4 - p2[i]); 
if (v>=0) 
s+= (float) v; 
else 

s-= (float) v; 

} 


pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s, 

} 


static float dist2(char *blkl, 
char *blk2, 
int lx, 
char hx, 
char hy, 
char h ) 

{ 

char *pl,*pla,*p2; 
char i, j; 
int v; 
float s; 

s = 0; 
pi = blkl; 
p2 = blk2; 
if (!hx && !hy) 
for (j=0; j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = pl[i] - P 2[i]; 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (hx && !hy) 
for O=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+l)/2-p2[i]); 
s+= (float) (v*v); 

} 

pl+= lx; 
p2+= lx; 

} 

else if (!hx&& hy) 

{ 

pla = pi + lx; 
for 0=0; j<h; j++) 

for (i=0; i<16; i++) . 

< 


v = (int) ((pl[i]+pla[i]+l) / 2 - P 2[i]); 
s+= (float) (v*v); 

} 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 

} 

else /* if (hx && hy) */ 

{ 

pla = pi + lx; 
for (j=0,j < h;j ++ ) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) ((pl[i]+pl[i+l]+pla[i]+pla[i+l]+2) / 4 - p2[i]); 
s+= (float) (v*v); 

> 

pi = pla; 
pla+= lx; 
p2+= lx; 

} 


return s; 

} 


static float bdistl(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
int v; 
float s; 

pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 


pba = pb + hxb; 
pbb = pb + lx*hyb; 
pbc = pbb + hxb; 

s = 0; 

for (j=0; j<h; j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 


- *p2++); 


if (v>=0) 
s+= (float) v, 
else 

s-= (float) v; 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16; 
plb+= lx-16; 
pfc+= lx-16; 
pb+= lx-16; 
pba+= lx-16, 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 


static float bdist2(char *pf, char *pb, char *p2, 
int lx, int hxf, 
int hyf, int hxb, 
int hyb, char h ) 

{ 

char *pfa,*pfb,*pfc,*pba,*pbb,*pbc; 
char i, j; 
int v; 
float s; 


pfa = pf + hxf; 
pfb = pf + lx*hyf; 
pfc = pfb + hxf; 


pba = pb + hxb; 
pbb = pb + lx* hyb, 
pbc = pbb + hxb; 

s = 0; 

for (j=0;j<h;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = (int) (((*pf++ + *pfa++ + *pfb++ + *pfc++ + 2) / 4 + 

(*pb++ + *pba++ + *pbb++ + *pbc++ + 2) / 4 + 1) / 2 
- *p2++); 
s+= (float) (v*v); 

} 

p2+= lx-16; 
pf+= lx-16; 
pfa+= lx-16, 


pfb+= lx- 16; 
pfc+= lx- 16, 
pb+= lx- 16; 
pba+= lx- 16; 
pbb+= lx-16; 
pbc+= lx-16; 

} 

return s; 

} 


static float variance(char *p, int lx ) 

{ 

char ij; 
intv; 

float s, s2; 

s = s2 = 0; 

for0=0;j<16;j++) 

{ 

for (i=0; i<16; i++) 

{ 

v = *p++; 
s+= (float) v; 
s2+= (float) (v*v); 

> 

p+= lx-16; 

} 

s=0.0625*s; 
return s2 - s*s, 

} 


/********************************/ 


/* 


*/ 

/* 

tO.c residing at Root 

*/ 

/* 


*/ 


/********************************/ 

#include "c:\tc2v2\dos.h” 

#include "c:\tc2v2\float.h" 

#include "c\tc2v2\math.h" 
include "c:\tc2v2\ctype.h" 

^include "c:\tc2v2\stdio.h" 

#include "c:\tc2v2\stdlib.h" 

^include "c.\tc2v2\string.h" 

#include "c:\tc2v2\chan.h” 

^include "c:\tc2v2\thread.h" 

#include "c:\tc2v2\time.h" 

#include "c:\tc2v2\ermo.h" 

#define GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 
include "c:\nick\h\globaI.h" 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

void main(argc,argv,envp,ins,in_ports,outs,out_ports) 
int argc, ins, outs; 
char *argv[],*envp[], 

CHAN *in_ports[],*out_ports[J; 

{ 

if (argc!=2) 

{ 

printf("Usage: mpeg2encode in.par\n"); 
exit(O); 

} 

/* read parameter file */ 
readparmiile(argv[ 1 ]); 

init(); 

putseqO; 


void error(char *text) 

{ 

fprintf(stderr,text), 

putcC\n',stderr); 

exit(l); 

} 


static void readparmfile(char *fname) 


{ 

int i; 

int h,m,s,f; 

FILE *fd; 
char line[256], 

int r,Xi,Xb,Xp,dOi,dOp,dOb; /* rate control */ 
double avgact; /* rate control */ 

sprintf(line,"c:\\nick\\par\\%s",fname); 
if (!(fd = fopen(line,"r"))) 

{ 

sprintf(errortext,"Couldn't open parameter file %s",line); 
error(errortext); 

} 

fgets(id_string,254,fd); 
fgets(line,254,fd); sscanf(line,"%s",tplorg); 
fgets(line,254,fd); sscanf(line,"%s",tplref); 
fgets(line,254,fd); sscanf(line,"%s",iqname); 
fgets(line,254,fd); sscanf(line,"%s”,niqname); 
fgets(line,254,fd), sscanf(line,"%s",statname); 
fgets(line,254,fd), sscanf(line,"%d",&inputtype); 
fgets(line,254,fd); sscanf(line,”%d",&nframes); 
fgets(line,254,fd); sscanf(line,''%d",&frameO); 
fgets(line,254,fd); sscanf(line > "%d:%d:%d:%d",&h,&m,&s,&0; 
fgets(line,254,fd); sscanf(line,"%d",&N); 
fgets(line,254,fd); sscanf(line,”%d",&M); 
fgets(line,254,fd); sscanf(line,"%d",&mpegl); 
fgets(line,254,fd); sscanf(line,"%d",&fieldpic), 
fgets(line,254,fd); sscanf(line,"%d",&horizontal_size); 
fgets(line,254,fd); sscanf(line,"%d",&vertical_size); 
fgets(lme,254,fd); sscanf(line,"%d",&aspectratio); 
fgets(line,254,fd); sscanf(line,"%d",&framejrate_code); 
fgets(line,254,fd); sscanf(line,"%lf\&bit_rate); 
fgets(line,254,fd); sscanf(line,"%d",&vbv_bufrer_size); 
fgets(line,254,fd); sscanf(line,"%d",&low_delay); 
fgets(hne,254,fd); sscanf(line,"%d",&constrparms); 
fgets(line,254,fd); sscanf(line,”%d" > &profile); 
fgets(line,254,fd); sscanf(line,"%d",&level); 
fgets(line,254,fd); sscanf(line, ''%d",&prog_seq); 
fgets(line,254,fd); sscanf(line,"%d",&chroma_format); 
fgets(line,254,fd); sscanf(line,"%d",&video_format); 
fgets(line,254,fd); sscanf(line,’ , %d",&color_primaries); 
fgets(line,254,fd); sscanf(line,"%d" > &transfer_characteristics), 
fgets(line,254,fd); sscanf(line,"%d",&matrix_coefficients); 
fgets(line,254,fd); sscanf(line,''%d",&display_horizontal_size); 
fgets(line,254,fd); sscanf(line,"%d",&display_vertical_size); 
fgets(line,254,fd); sscanf(line,"%d",&dc_prec); 
fgets(line,254,fd); sscanf(line,”%d",&topfirst); 
fgets(line,254,fd); sscanf(line,"%d %d %d", 
framej5red_dct_tab,frame_pred_dct_tab+ 1 ,frame_pred_dct_tab+2); 

fgets(line,254,fd); sscanf(line,"%d %d %d", 
conceal_tab,conceal_tab+ 1 ,conceal_tab+2); 


Q.-H. 


fgets(line,254,fd); sscanf(hne,"%d %d %d”, 
qscale_tab,qscale_tab+ 1 ,qscale_tab+2); 

fgets(line,254,fd); sscanf(line,"%d %d %d", 
intravlc_tab,intravlc_tab+l,intravlc_tab+2); 
fgets(line,254,fd); sscanf(line,"%d %d %d", 
altscan_tab,altscan_tab+ 1 ,altscan_tab+2); 
fgets(line,254,fd); sscanf(line,"%d",&repeatfirst); 
fgets(line,254,fd), sscanf(line,"%d",&prog_frame); 

/* intra slice interval refresh period */ 
fgets(line,254,fd); sscanf(line,”%d",&P); 
fgets(line,254,fd); sscanf(line,"%d",&r); 
fgets(line,254,fd); sscanf(line, "%lf \&avg_act); 
fgets(line,254,fd); sscanf(line,''%d",&Xi); 
fgets(line,254,fd); sscanf(line,"%d”,&Xp); 
fgets(line,254,fd), sscanf(line,"%d",&Xb), 
fgets(line,254,fd); sscanf(line,"%d",&dOi); 
fgets(line,254,fd); sscanf(line,"%d",&dOp); 

. fgets(line,254,fd); sscanf(line,"%d",&dOb); 

if (N<1) 

error("N must be positive"); 
if (M<1) 

error("M must be positive"); 
if (N%M != 0) 

error(”N must be an integer multiple of M"); 

motiondata = (struct motiondata *)malloc(M*sizeof(struct motiondata)); 
if (Imotiondata) error("malloc failed\n"); 


for (i=0, i<M; i++) 

{ 

fgets(line,254,fd); 
sscanf(line,"%d %d %d %d", 

&motion_data [i ] . forw hor f code, &motion_data[i] .forw vert f code, 
&motion data [i ]. sxf, &motion_data[i].syf); 

if (i!=0) 

{ 

fgets(line,254,fd); 
sscanf(line,"%d %d %d %d", 

&motion_data[i] .back hor f code, &motion_data[i] .back_vert_f_code, 
&motion_data[i].sxb, &motion_data[i].syb); 

} 

} 

fclose(fd); 

/* make flags boolean (x'=0 -> x=l) *1 
mpegl = Mmpegl; 
fieldpic = ! Ifieldpic; 
low_delay= Mlowdelay; 
constrparms = Hconstrparms; 


prog_seq = !!prog_seq; 
topfirst = ! Itopfirst; 

for (i=0; i<3, i++) 

{ 

frame_pred_dct_tab[i] = !!frame_pred_dct_tabli]; 
conceal_tab[i] = !!conceal_tab[i]; 
qscale_tab[i] = !!qscale_tab[i]; 
intravlc_tab[i] = !!intravlc_tab[i]; 
altscantabji] = ! !altscan_tab[i]; 

} 

repeatfirst = Mrepeatfirst; 
prog_frame = !!prog_frame; 

chan_out_word(M,Link 1 Output); 
chan_out_word(M,Link20utput); 

chan_out_word(nframes,LinklOutput); 

chan_out_word(nframes,Link20utput); 

chan_out_word(chroma_format,Link 1 Output); 
chan_out_word(chroma_format,Link20utput); 


static void init() 

{ 

int i, size; 

static int block_count_tab[3] = {6,8,12}; 
init_fdct(); 

/* round picture dimensions to nearest multiple of 16 or 32 */ 
mbwidth = (horizontal_size+15)/16; 

mb_height = prog_seq ? (vertical_size+15)/16 : 2*((vertical_size+31)/32); 
mb_height2 = fieldpic ? mb_height»l : mb_height; /* for field pictures */ 
width = 16*mb_width; 
height = 16*mb_height; 

chrom width = (chroma_format=CHROMA444) ? width : width»l; 
chrom height = (chroma_format!=CHROMA420) ? height : height»l; 

height2 = fieldpic ? height» 1 : height; 

width2 = fieldpic ? width«l : width; 

chrom_width2 = fieldpic ? chrom_width«l : chrom width; 

blockcount = block_count_tab[chroma_format-l]; 

/* clip table */ 

if (!(clp = (char *)malloc(1024))) 
error(’’malloc failed\n"); 
clp+= 384; 

for (i=-384; i<640; i++) 


clp[i] = (i<0) ? 0 : ((i>255) ? 255 : i); 


for (i=0; i<3; i++) 

{ 

size = (i=0) ? width*height : chrom_width*chrom_height; 

if (!(auxorgframe[i] = (char *)malloc(size))) 
error("malloc failed for auxorgframe\n"); 
if (!(predframe[i] = (char *)malloc(size») 
error("malloc failed for predframe\n"); 

} 

mbinfo = (struct mbinfo *)malloc(mb_width*nib_height2*sizeof(struct mbinfo)); 
if (Imbinfo) error("malloc failed mbinfoVn"); 

blocks = (int (*)[64])malloc(mb_widlh*mb_height2*block_count*sizeof(int [64])); 
if (Iblocks) error("malloc failed for blocks\n"); 

chan_out_word(mb_width,Link 1 Output); 
chan_out_word(mb_width,Link20utput); 

chan_out_word(mb_height,Link 1 Output); 
chan_out_word(mb_height,Link20utput); 

chan_out_word(mb_height2,LinklOutput); 

chan_out_word(mb_height2,Link20utput); 

chan_out_word(width,Link 1 Output); 
chan_out_word(width,Link20utput); 

chan_out_\vord(height,Link 1 Output); 
chan_out_word(height,Link20utput); 

chan_out_word(width2,Link 1 Output), 
chan_out_word(width2,Link20utput); 

chan_out_word(height2,Link 1 Output), 
chan_out_word(height2,Link20utput); 

chan_out_word(chrom_width,Link 1 Output); 
chan_out_word(chrom_width,Link20utput); 

chan_out_word(chrom_height,Link 1 Output); 
chan_out_word(chrom_height,Link20utput); 

chan_out_word(chrom_\vidth2,LinklOutput); 

chan_out_word(chrom_width2,Link20utput); 

chan_out_word(block_count,LinklOutput); 

chan_out_\vord(block_count,Link20utput); 


} 



void putseqO 

{ 

/* this routine assumes (N % M) == 0 */ 
int i,j, f; 

unsigned int a, b, c, p, q; 

FILE *f_mbi, *f_d, *f_status, *f_blk, 
char name[128], namel[128); 
char *neworg[3]; 
int finished ; 

for (j=0;j<3;j-H-) 
neworgtj] = auxorgframe[j]; 

/* loop through all frames in encoding/decoding order */ 
for (i=0; i<nframes; i++) 

{ 

chan_out_word(i,Link 1 Output); 
chan_out_word(i,Lmk20utput); 


/* read in data from the first program */ 
sprintf(name,tplorg,i); 
sprintf(name 1 ,"c:\\nick\\dtrn\\%s.st'',name); 
if (!(f_status = fopen(namel,"r”))) 

{ 

sprintf(errortext,"Couldn't open %s\n",namel); 
error(errortext); 

} 

fscanf(f_status,"%d %d %d ", &f, &pict_struct, &frame_pred_dct); 
fclose(f_status); 

fprintf(f_time,"frame:%d\n", f); 

sprintf(name,tplorg,f+frameO); 

sprintf(namel,"c:\\nick\\dtm\\%sy.nor",name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fread(neworg[0],l,width*height,f_d); 

fclose(fd); 

sprintf(namel,"c:\\nick\\dtm\\%su;nor",name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn’t open %s\n",namel); 
error(errortext); 

} 

fread(neworg[ 1 ], 1 ,chrom_width*chrom_height,f_d); 
fclose(fd). 


sprintf(namel,"c:\\nick\\dtm\\%sv.nor",name); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn’t open %s\n",namel); 
error(errortext); 

} 

fread(neworg[2], 1 ,chrom_width*chrom_height,f_d); 
fclose(f_d), 

sprintf(namel,"c:\\nick\\dtm\\%sy.prf',name); 
if (!(f_d = fopen(nantel,”rb"))) 

{ 

sprintf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fread(predframe[0], 1 ,width*height,f_d); 
fclose(f_d); 

sprintf(name 1 ,"c:\\nick\\dtm\\%su prf'.name); 
if (!(f_d = fopen(namel,"rt)"))) 

{ 

spnntf(errortext, "Couldn't open %s\n",namel); 
error(errortext); 

} 

fread(predframe[l], 1 ,chrom_vvidth*chrom_height,f_d); 
fclose(f_d); 

sprintf(namel,"c:\\nick\\dtm\\%sv.prf’,nanie); 
if (!(f_d = fopen(namel,"rb"))) 

{ 

sprintf(errortext, "Couldn’t open %s\n",namel); 
error(errortext); 

} 

fTead(predframe[2],l,chrom _width*chrom_height,f_d); 
fclose(f_d); 

chan_out_word(pict_struct,LinklOutput); 

chan_out_word(pict_struct,Link20utput); 

chan_out_word(frame_pred_dct,Link 1 Output); 
chan_out_word(frame_pred_dct,Link20utput); 

chan_out_message(width*height,neworg[0],LinklOutput); 

chan_out_message(width*height,neworg[0],Link2Output). 

chan_out_message(chrom _width*chrom_height,neworg[l],LinklOutput); 
chan_out_message(chrom_width*chrom_height,neworg[l],Link2 Output); 

chan_out_message(chrom_width*chrom_height,neworg[2],LinklOutput); 
chan_out_message(chrom_width*chrom_height,neworg[2],Link2 Output); 

chan_out_message(width*height,predframe[0],LinklOutput); 

chan_out_message(width*height,predframe[0],Link2Output); 


chan_out_message(chrom_width*chrom_height,predframe[ 1 ], Link 1 Output), 
chan_out_message(chrom_width*chrom_height,predframe[ 1 ],Link20utput); 

chan_out_message(chrom_width*chrom_height,predframe[2],LinklOutput); 

chan_out_message(chrom_width*chrom_height,predframe[2],Link20utput); 

/* loop through 36 macroblocks of the picture */ 

transform(predframe,nevvorg,mbinfo, blocks, 1 ); 

for (a=36; a<146; a++) 

{ 

chan_in_word(&mbinfo[a].dct_type,Link 1 Input); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chan_in_word(&blocks[a*block_count+b][c],LinklInput); 

} 

for (a=146; a<256; a++) 

{ 

chan_in_word(&mbinfo[a].dct_type,Link2Input); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chanJn_word(&blocks[a*block_count+b][c],Link2Input); 

} 

/* write data for the third program */ 

sprintf(namel,"c:\\nick\\tm_out\\%s.mbi",name), 
if (!(f_mbi = fopen(namel,"w/0"))) 

{ 

sprintf(errortext,"Couldn’t open %s\n”,namel); 
error(errortext); 

} 

sprintf(namel,"c:\\nick\\tm_out\\%s.blk",name); 
if (!(f_blk = fopen(namel,"w/0"))) 

{ 

sprintf(errortext, "Couldn’t open %s\n",namel); 
error(errortext); 

} 


for (a=0; a<mb_height2 * mb width ; a++) 

{ 

fprintf(f_mbi,"%d ", mbinfo[a].dct_type); 

for (b=0; b<block_count, b++) 

{ 

for (c=0; c<64; C++) 

lprintf(f_blk,"%d ", blocks[a*block_count+b][c]), 


fprintf(f_blk,"\n"); 


} 

} 


fclose(f_mbi), 

fclose(fblk); 


} 

fclose(f_time); 

chan_out_word(i,LinklOutput); 

chan_out_\vord(i,Link20utput); 

chan_in_\vord(&finished,Link 1 Input); 
chan_in_\vord(&finished,Link2Input); 


> 


void dct_type_estimation _ANSI_ARGS_((char *pred, 
char *cur, 

struct mbinfo *mbi)), 

static void add_sub_pred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub)); 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

void transform(char *pred[], 
char *cur[], 
struct mbinfo *mbi, 
int blocks[] [64], 
char forward_invers ) 

{ 

int i, j, il, jl, n, cc, offs, lx; 
int k, ii, jj; 

if (forward Jnvers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract predicuon and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

/* loop through 36 macroblocks of the picture */ 

for(k=0; k<36; k++) 

{ 

jj = k»4; 
ii = k - (jj«4); 
j=jj«4; 


i = ii«4; 


for (n=0; n<block_count; n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l, /* color component index */ 
if (cc=0) 

{ 

/* luminance */ 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type) 

{ 

/* field DCT */ 

offs — i + ((n&l)«3) + width*(j+((n&2)» 1)); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? 1 : i»l; 
jl = (chroma format ! =CHROMA420) ?j : j»l; 

if ((pict_stnict=FRAME_PICTURE) && mbi[k].dct_type 
&& (chroma_format!=CHROMA420)) 

{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

/•frame DCT*/ 

offs = il + (n&8) + chrom_width2*(jl+((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += chromwidth; 

} 

if (forward invers) /* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+offs, 


lx, 

blocks[k*block_count+n] , 

0 ); 


fdct(blocks[k*block_count+nJ); 

} 

else /* 0 for invers DCT ♦/ 

{ 

/* idct(blocks[k*block_count+n]);*/ 
add_sub_pred(pred[cc]+offs, 
cur[ccj+offs, 
lx, 

blocks[k*block_count+n], 

1 ); 


} 

} 


/* add prediction and prediction error, saturate to 0. .255 */ 
/♦ subtract prediction from block data ♦/ 
static void add_sub_pred(char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub ) 

{ 

chari, j; 

for (j=0;j<8;j++) 

{ 

for (i=0; i<8; i++) 

{ 

if (add_sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ]; 
else /* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 

} 


void dct_type_estimation(char *pred, 
char ♦cur, 

struct mbinfo *mbi ) 

{ 

int blkO[128], blkl[128]; 
int i, j, iO, jO, offs; 
double sO, si, sqO, sql, sOl; 


double d, r; 
int k, ii, jj; 


for(k=0; k<36; k++) 

{ 

jj = k»4; 
ii = k - (ij«4); 

jo = jj«4; 

iO = ii«4; 

if (frame_pred_dct || pict_struct!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for G=0;j<8;j++) 

{ 

offs = width*((j«l)+jO) + iO; 
for (i=0, i<16; i++) 

{ 

blk0[16*j+i] = cur[offs] - p red [offs]; 
blkl[16*j+i] = cur[offs+width] - pred[offs+width], 
offs++; 

} 

} 

/* correlate fields */ 
sO=s 1 =sqO=sq 1 =s0 1=0; 


for (i=0; i<128; i++) 

{ 

sO+= (double) blkO[i]; 
sqO+= (double) (blkO[i]*blkO[i]); 
sl+= (double) blkl [i]; 
sql+= (double) (blkl [i]*blkl [i]); 
s01+= (double) (blk0[i]*blkl[ij); 

} 

s0 = 0.125 *s0; 
si =0.125* si; 

d = (sq0-(s0*s0)/2.0)*(sql-(sl*sl)/2.0); 

if (d>0.0) 

{ 

r = (sO 1 -(sO*s 1 )/2.0)/sqrt(d); 
if (r>0.5) 

mbi[k].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

} 

} 




#ifndefPI 

# ifdef M_PI 

if define PI M_PI 
if else 

# define PI 3 14159265358979323846 
if endif 

#endif 


static double c[8] [8]; /* transform coefficients */ 

void init_fdct() 

{ 

char i, j; 
double s; 

for (i=0; i<8; i++) 

{ 

s = (i=0) ? sqrt(0. 125) : 0.5; 

for (j=0; j<8; j++) 

c[i][j] = s * cos((PI/8.0)*i*(j+0.5)); 

} 

} 

void fdct(int *block) 

{ 

char i, j, k; 
double s; 
double tmp[64]; 


for (i=0; i<8; i++) 
for (j=0; j<8; j++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 


tmp[8*i+j] = s; 

} 

for (j=0;j<8;j++) 
for (i=0; i<8; i++) 

{ 

s = 0.0, 

for (k=0; k<8; k++) 
s += c[i][k] * tmp[8*k+jj; 

block[8*i+j] = (int)floor(s+0.499999); 


} 


} 


/********************************/ 


/* 


*/ 

/* 

tOl c residing at PEI 

*/ 

/* 


*/ 


/********************************/ 


#include "c:\tc2v2\dos.h" 

#include "c:\tc2v2\float.h" 

#include "c:\tc2v2\math.h" 

#include "c:\tc2v2\ctype.h" 

#include ''c:\tc2v2\stdio.h" 

#include "c:\tc2v2\stdlib.h" 

#include "c:\tc2v2\string.h" 

#include "c:\tc2v2\chan.h" 

#include "c:\tc2v2\thread.h" 

^include "c:\tc2v2\time.h" 

#include "c:\tc2v2\ermo.h" 

#define GLOBAL /* used by global.h *1 

^include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h” 

#include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

void main(argc,argv,envp,ins,in_ports,outs,out_ports) 
int argc.ins.outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*out_ports|]; 

{ 

/* read parameter file */ 
chan_in_\vord(&M,LinkOInput); 
chan_out_word(M,Link 1 Output); 
chan_out_word(M,Link20utput); 

chan_in_word(&nframes,LinkOInput); 
chan_out_word(nframes,Link 1 Output); 
chan_out_word(nfirames,Link20utput); 

chanin _word(&chroma_format,LinkOInput); 

chan_out_word(chroma_format,LinklOutput); 

chan_out_word(chroma_format,Link20utput); 

init(); 

putseqO; 


} 


static void init() 

{ 


int i, size; 


init_fdct(); 

chan_in_word(&mb_width,LinkOInput); 
chan_out_\vord(mb_vvidth,Linkl Output); 
chan_out_\vord(mb__\vidth,Link20utput); 

chan_in_word(&mb_height,LinkOInput); 
chan_out_word(mb_height,Link 1 Output); 
chan_out_word(mb_height,Link20utput); 

chan_in_word(&mb_heighl2,LinkOInput); 
chan_out_word(mb_height2,LinklOutput); 
chan_out_word(mb_height2,Link2 Output); 

chan_in_word(&width,LmkOInput); 
chan_out_word(width,Link 1 Output), 
chan_out_word(width,Link20utput); 

chanJn_\vord(&height,LinkOInput); 
chan_out_word(height,Link 1 Output) ; 
chan_out_word(height,Link20utput); 

chan_in_word(&\vidth2,LinkOInput); 
chan_out_word(width2,Link 1 Output); 
chan_outjword(width2,Link20utput); 

chan_in_word(&height2,LinkOInput); 

chan_out__word(height2,LinklOutput); 

chan_out_word(height2,Link20utput); 

chan_in_\vord(&chrom_vvidth,LiiikOInput); 
chan_out_word(chrom_\vidth,LinklOutput); 
chan_out_word(chrom_width,Link2 Output); 

chan_in_word(&chrom_height,LinkOInput); 
chan_out _word(chrom_height. Link 1 Output) ; 
chan_out_word(chrom_height,Link20utput); 

chan_in_word(&chrom_width2,LinkOInput); 
chan_out_word(chrom_width2,Link 1 Output); 
chan_out_word(chrom_width2,Link20utput); 

chan_injword(&block_count,Link01nput); 
chan_out_word(block_count,Link 1 Output); 
chan_out_word(block_count,Link20utput); 

/* clip table */ 

clp = (char *)malloc(1024); 

clp+= 384; 

for (i=-384; K640; i++) 
clp[i] = (i<0) ? 0 : ((i>?55) ? 255 : i); 


for (i=0, i<3; i++) 

{ 

size = (i=0) ? width*height . chrom_width ,| ‘chrom_height; 

auxorgframe[i] = (char *)malloc(size); 
predframe[i] = (char *)malloc(size); 

} 

mbinfo = (struct mbinfo *)malloc(l 10*sizeof(struct mbinfo)); 
blocks = (int (*)[64])malloc(110*block_count*sizeof(int [64])), 


} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i,j, f; 

unsigned int a, b, c; 
char *neworg[3]; 
int loop; 

for (j=0; j<3; j++) 
neworg[j] = auxorgffame[j]; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 

chan_out_word(loop,LinklOutput), 

chan_out_word(Ioop,Link20utput); 

if(loop=nframes) break; 

chan_in_word(&pict_struct,LinkOInput); 

chan_out_word(pict_struct,LinklOutput); 

chan_out_word(pict_struct,Link20utput); 

chan_in _word(&ffame_pred_dct,LinkOInput); 

chan_out_word(frame_pred_dct,LinklOutput); 

chan_out_word(frame_pred_dct,Link20utput); 

chan_in_message(width*height,neworg[0],LinkOInput); 

chan_out_message(width*height,neworg[0],LinklOutput); 

chan_out_message(width*height,neworg[0],Link20utput); 

chan_in_message(chrom _width*chrom_height,neworg[l],LinkOInput); 
chan_out_message(chrom_width*chrom_height,neworg[l],LinklOutput); 
chan_out_message(chrom_width*chrom_height,neworg[ 1 ],Link20utput); 

chan_in_message(chrom_width*chrom_height,neworg[2],LinkOInput); 

chan_out_message(chrpm_\vidth*chrom_height,neworg[2],LinklOutput); 

chan_out_message(chrom_width*chrom_height,neworg[2],Link20utput); 


chanin _message(width*height,predframe[0],LinkOInput); 

chan_out_message(\vidth*height,predframe[0],LinklOutput); 

chan_out_message(width' |, height,predframe[0],Link2Output), 

chan_in_message(chrom_width*chrom_height,predframe[ 1 ] .LinkOInput); 

chan_out_message(chrom_width*chrom_height,predframe[l],LinklOutput), 

chan_out_message(chrom_\vidth*chrom_height,predframe[l],Link20utput); 

chan_in_message(chrom_width*chrom_height,predfraine[2], LinkOInput); 

chan_out_message(chrom_width*chrom_height,predfraine[2],LinklOutput); 

chan_out_message(chroni_width*chrom_height,predframe[2],Link20utput); 

/* loop through 36 macroblocks of the picture */ 

transform(predframe,neworg,mbinfo, blocks, 1 ); 

for (a=36; a<73; a++) 

{ 

chanin _word(&mbinfo[a] .dcttype.Link 1 Input); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; c++) 

chan_in_word(&blocks[a*block_count+b][c],LinklInput), 

} 

for (a=73; a<l 10; a++) 

{ 

chan_in_word(&mbinfo[a].dct_type,Link2Input); 

for (b=0; b<block_count; b++) 
for (c=0, c<64; c++) 

chan_in_word(&blocks[a*block_count+b][c],Link2Input); 

} 

for (a=0, a<l 10; a++) 

{ 

chan_out_word(mbinfo[a].dct_type,LinkOOutput); 

for (b=0, b<block_count; b++) 
for (c=0; c<64; c++) 

chan_out_word(blocks[a*block_count+b][c],LinkOOutput); 

} 

}while(loop<nframes); 

chan_in_word(&loop,Link 1 Input), 
chan_in_\vord(&loop,Link2Input); 
chan_out_word(loop,LinkOOutput); 

}/*end of main()*/ 

void dct_type_estimation _ANSI_ARGS_((char *pred, 
char *cur, 



struct mbinfo *mbi)); 


static void add_sub_pred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub)); 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

void transform(char *pred[], 
char *cur[], 
struct mbinfo *mbi, 
int blocks[] [64], 
char forward_invers ) 

{ 

inti, j, il, jl, n, cc, offs, lx; 
int k, ii, jtf; 


if (forward_invers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

/* loop through 36 macroblocks of the picture */ 

for(k=0; k<36; k++) 

{ 

jj = (k+36)»4; 
ii = (k+36) - (jj«4); 
j=jj«4; 
i = ii«4; 

for (n=0; n<block_count; n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l ; /* color component index */ 
if (cc=0) 

{ 

/* luminance */ 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type) 

{ 

I* field DCT */ 

offs = i + ((n&l)«3) + width*(j+((n&2)»l)); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = width2; 

} 


if (pict_stmct=BOTTOM_FIELD) 


offs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format==CHROMA444) ? 1 : i»l; 
jl = (chroma_format!=CHROMA420) ? j : j»l; 

if ((pict_struct==FRAME_PICTURE) && mbi[k].dct_type 
&& (chroma format ! =CHROMA420)) 

{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = il + (n&8) + chrom_width2*(jl+((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += chromwidth; 

} 

if (forward_invers) /* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+offs, 

lx, 

blocks[k*block_count+n], 

0 ); 

fdct(blocks[k*block_count+n]); 

} 

else /* 0 for invers DCT *1 

{ 

/* idct(blocks[k*block_count+n]);*/ 
add_sub_pred(pred [ccj+offs, 
cur[cc]+offs, 
lx, 

blocks(k*block_count+n], 

1 ), 

} 

} 

} 

} 


/* add prediction and prediction error, saturate to 0...255 */ 
/* subtract prediction from block data *1 



static void add_sub_pred(char *pred, 
char ♦cur, 
int lx, int *blk, 
char add_sub ) 

{ 

char ij; 

for 0=O;j < 8;j++) 

{ 

for (i=0; i<8; i++) 

{ 

if (add_sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ]; 
else /* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 

} 


void dct_type_estimation(char *pred, 
char *cur, 

struct mbinfo *mbi ) 

{ 

int blk0[128], blkl [ 128]; 
int i, j, iO, jO, offs; 
double sO, si, sqO, sql, sOl; 
double d, r; 
int k, ii, jj; 

for(k=0; k<36; k++) 

{ 

jj = (k+36)»4; 
ii = (k+36) - (ij«4); 
jO =jj«4; 
iO = ii«4; 

if (ffame_pred_dct || pict_struct!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for (j=0;j<8;j++) 

{ 

offs = width*((j«l)+jO) + iO; 
for (i=0; i<16; i++) 

{ 

blk0[16*j+i] = curfoffs] - pred[offs]; 
blkl[16*j+i] = cur[offs+width] - pred[offs+width]; 
offs++; 

} 


} 

/* correlate fields */ 
sO=s 1 =sqO=sq 1 =s0 1=0; 

for (i=0; i<128; i++) 

{ 

s0+= (double) blkO[i]; 
sqO+= (double) (blkO[i]*blkO[i]); 
sl+= (double) blkl [i]; 
sql+= (double) (blkl [i]*blkl [i]); 
s01+= (double) (blkO[i]*blkl[i]); 

} 

s0 = 0.125 * sO; 
si =0.125 * si; 

d = (sq0-(s0*s0)/2.0)*(sql-(sl*sl)/2.0); 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/2.0)/sqrt(d); 
if (r>0.5) 

mbi[k].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

} 


#ifndef PI 

# ifdefM_PI 

# define PI M_PI 

# else 

# define PI 3.14159265358979323846 

# endif 
#endif 


static double c[8] [8]; /* transform coefficients */ 

void init_fdct() 

{ 

char i, j, 
double s; 

for (i=0; i<8; i++) 

{ 

s = (i=0) ? sqrt(0.125) : 0.5; 

for (j=0;j<8;j++) 

c[i][j] = s * cos((PI/8.0)*i*(j+0.5)); 


} 

} 

void fdct(int *bIock) 

{ 

chari.j, k; 
double s; 
double tmp[64]; 


for (i=0; i<8; i++) 
for G=0;j<8;j++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 


tmp[8*i+j] = s; 

} 

for o=0;j<8;j++) 
for (i=0; i<8; i++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[i][k] * tmp[8*k+j]; 

block[8*i+j] = (int)floor(s+0.499999); 


} 


} 


/ft******************************/ 
/♦ */ 

/* tOll.c residing at PE2 */ 
/* */ 

/********************************/ 


^include "c:\tc2v2\dos.h" 
#include "c:\tc2v2\float.h" 
#include "c:\tc2v2\math.h" 
include "c:\tc2v2\ctype.h" 
^include "c:\tc2v2\stdio.h" 
#include "c:\tc2v2\stdlib.h" 
#include "c:\tc2v2\string.h" 
#include "c \tc2v2\chan.h" 
#include "c.\tc2v2\thread.h" 
#include "c:\tc2v2\time.h" 
^include "c:\tc2v2\ermo.h" 


#define GLOBAL /* used by global.h */ 

^include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

^include "c:\nick\h\global.h" 

I* private prototypes */ 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)), 

void main(argc,argv,envp,ins,in_ports, outs, out _ports) 
int argc.ins.outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*outjports[]; 

{ 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_in_word(&nframes,LinkOInput); 
chan_in_word(&chroma_format,LinkOInput); 

init(); 

putseqO; 


static void init() 

{ 

int i, size; 
init_fdct(); 

chan_in_word(&mb_width,LinkOInput); 

chan_m_word(&mb_height,LinkOInput); 

chanJn_word(&mb_height2,LinkOInput); 

chan_in_word(&width,LinkOInput); 


chan_in_word(&height,LinkOInput), 

chan_in_word(&width2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 

chan_in_word(&chrom_width,LinkOInput); 

chan_in_word(&chrom_height,LinkOInput); 

chan_in_word(&chrom_width2,LinkOInput); 

chan_in_word(&block_count,LinkOInput); 

/* clip table */ 

clp = (char. *)malloc( 1024); 

clp+= 384; 

for (i=-384; i<640; i++) 
clp[i] = (i<0) ? 0 : ((i>255) ? 255 • i); 

for (i=0, i<3; i++) 

{ 

size = (i=0) ? width*height : chrom_width*chrom_height; 

auxorgframe[i] = (char *)malloc(size); 
predframe[i] = (char *)maIIoc(size); 

} 

mbinfo = (struct mbinfo *)malloc(37*sizeof( struct mbmfo)); 
blocks = (int (*)[64])malloc(37*block_count*sizeof(int [64])); 

} 


void putseqO 

{ 

/* this routine assumes (N % M) == 0 */ 
int i, j, f; 

unsigned int a, b, c; 
char *ne\vorg[3], 
int loop; 

for (j=0, j<3; j++) 
neworg[j] = auxorgframe[j]; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 

if(loop=nframes) break; 

chan_in_word(&pict_struct > LmkOInput), 

chan_in_word(&frame_pred_dct,LinkOInput); 

chan_in_message(width*height,neworg[0],Link0Input); 

chan_in_message(chrom_width*chrom_height,neworg[ 1 ],LinkOInput); 

chan_in_message(chrom_width*chrom_height,neworg[2],LinkOInput); 

chan_in_message(width*height,predframe[0],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predframe[l],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predfirame[2],Link0Input); 


/* loop through 37 macroblocks of the picture */ 
transform(predframe,ne\vorg,mbinfo,blocks,l); 


for (a=0; a<37; a++) 

{ 

chan_out_word(mbinfo(a].dct_type,LinkOOutput); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chan_out_word(blocks|a*block_count+b][c],LinkOOutput); 

} 

}while(loop<nframes); 

chan_out_word(loop,LinkOOutput); 

}/*end of main()*/ 

void dct_type_estimation _ANSI_ARGS_((char *pred, 
char *cur, 

struct mbinfo *mbi»; 

static void add_sub_pred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char addsub)), 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

void transform(char *pred[J, 
char *cur[], 
struct mbinfo *mbi, 
int blocks[][64], 
char forward_invers ) 

{ 

int i, j, il, jl, n, cc, offs, lx; 
int k, ii, jj; 

if (forwardjnvers) 

dct_type_estimation(pred[0], cur(0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

/* loop through 37 macroblocks of the picture */ 

for(k=0, k<37; k++) 

{ 

jj = (k+72)»4; 
ii = (k+72) - (ij«4); 
j =jj«4; 
i = ii«4; 


for (n=0; n<block_count; n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l; /* color component index */ 
if (cc=0) 

{ 

/* luminance */ 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type) 

{ 

/* field DCT */ 

offs = i + ((n&l)«3) + width*(j+((n&2)»l)); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? i : i»l; 
jl = (chroma_format!=CHROMA420) ?j : j»l; 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type 
&& (chroma format ! =CHROMA420)) 

{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = il + (n&8) + chrom_width2*(jl+((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += chrom_width; 

} 

if (forward_invers) /* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+ofifs, 

lx, 

blocks [k*block_count+n], 


0 ); 


fdct(blocks[k*block_count+n]); 

} 

else /* 0 for invers DCT */ 


{ 

/* idct(blocks[k*block_count+n]);*/ 
add_sub_pred (pred [cc] +offs, 
cur[cc]+offs, 
lx, 

blocks[k*block_count+n], 

i); 

} 


} 

} 


} 


/* add prediction and prediction error, saturate to 0...255 */ 
/* subtract prediction from block data *1 
static void add_sub__pred(char *pred, 
char *cur, 
int lx, int *blk, 
char addsub ) 

{ 

char i, j; 

for (j=°; j <8 i j +H ‘) 

{ 

for (i=0; i<8; i++) 

{ 

if (add_sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ]; 
else f* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 


void dct_type_estimation(char *pred, 
char *cur, 

struct mbinfo *mbi ) 

{ 

int blk0[128], blkl[128]; 
int i, j, iO, jO, offs; 
double sO, si, sqO, sql, sOl; 
double d, r; 
int k, ii, jj; 


for(k=0; k<37; k++) 

{ 

ij = (k+72)»4, 
ii = (k+72) - Cij«4); 
jO = jj«4; 
iO = ii«4; 

if (frame_pred_dct || pict_struct!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for (j=0;j<8;j++) 

{ 

offs = wtdth*((j«l)+jO) + iO; 
for (i=0; i< 1 6; i++) 

{ 

blk0[16*j+i] = cur[offs] - pred[offs]; 
blkl[16*j+i] = cur[ofifs+width] - pred[offs+width]; 
offs++; 

} 

} 

/* correlate fields */ 
s0=s 1 =sq0=sq 1 =s0 1=0; 

for (i=0, i<128; i++) 

{ 

s0+= (double) blkO[i]; 
sqO+= (double) (blkO[i]*blkO[i]); 
sl+= (double) blkl[i]; 
sql+= (double) (blkl[i]*blkl[i]), 
s01+= (double) (blkO[i]*blkl[i]); 

} 

s0 = 0.125 *s0; 
si =0.125 * si; 

d = (sq0-(s0*s0)/2.0)*(sq 1 -(s 1 *s 1 )/2.0); 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/2.0)/sqrt(d); 
if (r>0.5) 

mbi[k].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct_type = 1, /* field DCT */ 

} 

else 

mbi[k] dct_type = 1; /* field DCT *1 

} 

} 

} 


#ifndef PI 


# ifdef M_PI 

# define PI M_PI 

# else 


# define PI 3.14159265358979323846 

# endif 
#endif 


static double c[8][8]; /* transform coefficients */ 

void init fdct() 

{ 

char i, j; 
double s; 

for (i=0, i<8; i++) 

{ 

s = (i=0) ? sqrt(0. 125) : 0.5; 

for (j=0; j<8;j++) 

c[i][j] = s * cos((PI/8.0)*i*(j+0.5)); 

} 

} 

void fdct(int ’block) 

{ 

char i,j, k; 
double s; 
double tmp[64]; 

for (i=0; i<8; i++) 
for O=0;j<8;j++) 

{ 

s = 0.0; 

for (k=0, k<8; k++) 
s += C |j][k] * block[8*i+k]; 

tmp[8*i+j] = s; 

} 

for G=o,j<8;j++) 
for (i=0; i<8; i++) 

{ 

s = 0.0; 

for (k=0, k<8; k++) 

S += c[i][k] * tmp[8*k+j]; 

block[8*i+j] = (int)floor(s+0.499999); 


} 


} 



/********************************/ 
/* */ 

/* tO 12.c residing at PE3 */ 
/* */ 

/********************************/ 


#include "c:\tc2v2\dos.h" 
#include ''c:\tc2v2\float.h" 
#include "c:\tc2v2\math.h" 
#include "c:\tc2v2\ctype.h'' 
^include "c:\tc2v2\stdio.h” 
^include "c:\tc2v2\stdlib.h" 
^include "c:\tc2v2\string.h" 
#include "c:\tc2v2\chan.h" 
#include "c:\tc2v2\thread.h" 
#include "c:\tc2v2\time.h" 
^include "c:\tc2v2\ermo.h" 


#defme GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h” 

#include "c:\nick\h\config.h” 

#include "c:\nick\h\global h" 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

void main(argc,argv,envp, ins, in_ports, outs, out_ports) 
int argc,ins,outs; 
char *argv[],*envp[]; 

CHAN * i n_ports [] , *out_ports [] ; 

{ 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_in_word(&nframes,LinkOInput); 
chan_in_\vord(&chroma_format,LinkOInput); 

init(); 

putseqO; 


static void init() 

{ 

int i, size; 
init_fdct(); 

chan in word(&mb_vvidth,LinkOInput); 

chan_in_word(&mb_height,LinkOInput); 

chan_in_word(&mb_height2,Link01nput); 

chan_in_word(&width,LinkOInput); 

chan_in_word(&height,LinkOInput); 


chan_in_word(&\vidth2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 

chan_in_\vord(&chrom_\vidth,LinkOInput); 

chan_in_word(&chrom_height,LinkOInput); 

chan_in_word(&chrom_\vidth2,LinkOInpul); 

chan_in_word(&block_count,LinkOInput), 

/* clip table */ 

clp = (char *)malloc(1024); 

clp+= 384; 

for (i=-384; i<640; i++) 
clp[i] = (i<0) ? 0 : ((i>255) ? 255 : i); 

for (i=0; i<3; i++) 

{ 

size = (i==0) ? width*height : chrom_width*chrom_height; 

auxorgframefi] = (char *)maJIoc(size); 
predffame[i] = (char *)malloc(size); 

} 

mbinfo = (struct mbinfo *)malloc(37*sizeof( struct mbinfo)); 
blocks = (int (*)[64])malloc(37*block_count*sizeof(int [64])); 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
•nt i, j, f; 

unsigned int a, b, c; 
char *neworg[3]; 
int loop; 


for (j=0;j<3;j++) 
neworgU] = auxorgframe[j]; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 

if(loop=nframes) break; 

chan_in_word(&pict_struct,LinkOInput); 

chan_in_word(&frame_pred_dct,LinkOInput); 

chan_in_message(width*height,neworg[0],LinkOInput); 

chan_in_message(chrom_width*chrom_height,neworg[l],LinkOInput); 

chan_in_message(chrom_width*chrom_height,neworg[2],Link01nput); 

chan_in_message(width' , 'height,predframe[0],LinkOInput); 

chan_in_message(chrom_width’ l 'chrom_height,predframe[ 1 ],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predffame[2],LinkOInput); 


/* loop through 37 macroblocks of the picture */ 

transforiTi(predframe,ne\vorg.mbinfo. blocks. 1 ); 

for (a=0, a<V7, a++) 

{ 

chan_oUt_word(mbinfo[a].dct_type,LinkOOutput); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chan_oUt_word(blocks[a*block_count+b][c],LinkOOutput); 

} 

}while(loop<nframes); 

chan_out_word(loop,LmkOOutput); 

}/*end of main()*/ 

void dct_typC_estimation _ANSI_ARGS_((char *pred, 
char *cur, 

struct mbinfo *mbi)), 

static void acld_sub_pred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub)); 

/* subtract prediction and transform prediction error *1 
/* inverse transform prediction error and add prediction */ 

void transform(char *pred[], 
char *cur[], 
stnrct mbinfo *mbi, 
int blocks[] [64], 
char forvvardjnvers ) 

{ 

int i, j, il, jl, n, cc, ofifs, lx; 
intk, ii, jj; 

if (forward.invers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

/* loop through 37 macroblocks of the picture */ 


for(k=0; k<37; k++) 

{ 

jj = (k+l09)»4; 
ii = (k+109) - (ij«4); 
j =ii«4, 
i = ii«4; 


for (n=0; ncblockcount; n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l; /* color component index *1 
if (cc==0) 

{ 

/* luminance */ 

if ((pict struct=FRAME_PI CTURE) && mbi[k].dct_type) 

{ 

/* field DCT */ 

ofifs = i + ((n&l)«3) + width*(j+((n&2)»l)); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

ofifs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = \vidth2; 

} 

if (pict_struct==BOTTOM_FIELD) 
ofifs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? i : i»l; 
jl = (chroma_format!=CHROMA420) ? j : j»l; 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type 
&& (chroma_format!=CHROMA420)) 

{ 

/* field DCT */ 

ofifs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

/* frame DCT */ 

ofifs = il + (n&8) + chrom_width2*(jl+((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
ofifs += chromwidth, 

} 

if (forward_invers) /* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+ofifs, 

cur[cc]+ofifs, 

lx. 


blocks [k*block_count+n], 

0 ); 


fdct(blocks[k*block_count+n]); 

} 

else /* 0 for invers DCT */ 

{ 

/* idct(blocks[k*block_count+n]);*/ 
add_sub_pred(pred[cc]+offs, 
cur[ccj+offs, 
lx, 

blocks(k*block_count+n] , 

i); 

} 

} 

} 

} 


/* add prediction and prediction error, saturate to 0...255 */ 
/* subtract prediction from block data */ 
static void add_sub_pred(char *pred, 
char ♦cur, 
int lx, int *blk, 
char addsub ) 

{ 

char i, j; 

for j<8; j++) 

{ 

for (i=0; i<8; i++) 

{ 

if (add_sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + predfi] ]; 
else /* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 

} 


void dct_type_estimation(char *pred, 
char *cur, 

struct mbinfo *mbi ) 

{ 

int blk0[128], blkl[128]; 
int i, j, iOJO.offs; 
double sO, si, sqO, sql, sOl; 
double d, r; 


int k, ii, jj; 


for(k=0; k<37; k++) 

{ 

jj = (k+109)»4; 
ii = (k+109) - (jj«4); 
jO=jj«4, 
iO = ii«4; 

if (framej»red_dct || pict_struct!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for (j=0; j<8; j++) 

{ 

offs = width*((j«l)+jO) + iO; 
for (i=0; i<16; i++) 

{ 

blk0[16*j+i] = cur[offs] - pred[offs]; 
blkl[16*j+i] = cur[offs+width] - pred[offs+width]; 
offs++; 

} 

} 

I* correlate fields */ 
sO=s 1 =sqO=sq 1 =s0 1 =0; 


for (i=0; i<128; i++) 

{ 

s0+= (double) blkOfi]; 
sqO+= (double) (blkO[i]*blkO[i]); 
sl+= (double) blklpj; 
sql+= (double) (blkl[i]*blkl[i]); 
s01+= (double) (blkO[i]*blkl[i]); 

} 

s0 = 0.125 *s0; 
si = 0.125* si; 

d = (sq0-(s0*s0)/2.0)*(sql-(sl*sl)/2.0); 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/2.0)/sqrt(d); 
if (r>0.5) 

mbi[k],dct_type = 0; /* frame DCT *1 
else 

mbi[k],dct_type = 1; /* field DCT */ 

} 

else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

} 


#ifndef PI 

# ifdef M_PI 

# define PI M_PI 

# else 

# define PI 3.14159265358979323846 

# endif 
#endif 


static double c[8][8]; /* transform coefficients *1 

void mit_fdct() 

{ 

char i, j; 
double s; 

for (i=0; i<8; i++) 

{ 

s = (i==0) ? sqrt(0.125) : 0.5; 


for G=0;j<8;j++) 

c[i][j] = s * cos((PI/8 0)*i*(j+0 5)); 

} 

} 

void fdct(int *block) 

{ 

char i, j, k; 

double s; 

double tmp[64], 

for (i=0; i<8; i++) 
for G=0,j < 8;j++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 


tmp[8*i+j] = s; 

} 

forG = 0;j < 8;j- H -) 

for (i=0; i<8, i++) 

{ 

s = 0.0, 


for (k=0; k<8; k++) 
s += c[i][k] * tmp[8*k+j]; 

block[8*i+j] = (int)floor(s+0.499999); 


} 


} 


/********************************/ 
/* */ 

/* t02.c residing at PE4 */ 

/* */ 

/********************************/ 


^include "c:\tc2v2\dos.h" 

#include "c:\tc2v2\float.h" 

#include "c:\tc2v2\math.h" 

#include "c:\tc2v2\ctype.h" 

^include "c:\tc2v2\stdio.h" 

^include "c:\tc2v2\stdlib.h" 

#include "c:\tc2v2\string.h" 

#include "c:\tc2v2\chan.h" 

#include "c:\tc2v2\thread.h" 

#include "c:\tc2v2\time.h" ■ 

#include "c\tc2v2\ermo.h" 

#defme GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

#include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

void main(argc,argv,envp, ins, in_ports, outs, out jports) 
int argc, ins, outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*out_ports[]; 

{ 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_out_word(M,Link 1 Output); 
chan_out_word(M,Link20utput); 

chan_in_word(&nfjrames,LinkOInput); 

chan_out_word(nframes,LinklOutput); 

chan_out_word(nframes,Link20utput); 

chan_in_word(&chroma_fonnat,LinkOInput); 
chan_out_word(chrorna_format,Link 1 Output); 
chan_out_word(chroma_format,Link20utput); 

init(); 

putseqO; 


} 


static void init() 

{ 

int i, size; 


init_fdct(); 

chanjn_word(&mb_width,LinkOInput); 
chan_out_word(mb_width,Link 1 Output), 
chan_out_word(mb width, Link20utput); 

chan_in_word(&mb_height,LinkOInput); 

chan_out_word(mb_height,LinklOutput), 

chan_out_word(mb_height,Link20utput); 

chan_in_word(&mb_height2,LinkOInput); 
chan_out_word(mb_height2,Link 1 Output); 
chanjmt_word(mb_height2,Link20utput); 

chan_in_word(&width,LinkOInput); 
chanj>ut_word(width,Link lOutput); 
chan_out_word(width,Link20utput); 

chan_in_word(&height,LinkOInput); 
chan_out_word(height,Link 1 Output); 
chan_out _word(height,Link20utput); 

chan_in_word(&width2,LinkOInput); 
chan_out_word( width2,Link 1 Output); 
chan_out_word(width2,Link20utput); 

chan_in_word(&height2,LinkOInput); 
chan_out_word(height2, Link lOutput); 
chan_out_word(height2,Link20utput); 

chanjn_word(&chrom_width,LinkOInput); 
chan_out_word(chrom_width,Link 1 Output); 
chan_out_word(chrom_width,Link20utput); 

chan_in_word(&chrom_height,LinkOInput); 
chan_out_word(chrom_height,Link 1 Output); 
chan_out_word(chrom_height,Link20utput); 

chan_in_word(&chrom_width2,LinkOInput); 
chan_out_word(chrom_width2 , Link 1 Output) ; 
chan_out_word(chrom_width2,Link20utput), 

chan_in_word(&block_count,LinkOInput), 
chan_out_word(block_count, Link lOutput); 
chan_out_word(block_count,Link20utput); 

/* clip table */ 

clp = (char *)malloc(1024); 

clp+= 384; 

for (i=-384; i<640; i++) 
clpti] = (i<0) ? 0 : ((i>255) ? 255 : i); 


for (i=0; i<3; i++) 


{ 

size = (i=0) ? width*height : chrom_width*chrom_height; 

auxorgframefi] = (char *)malloc(size); 
predframe[i] = (char *)malloc(size); 

} 

mbinfo = (struct mbinfo *)malloc(110*sizeof(struct mbinfo)); 
blocks = (int (*)[64])malloc(l 10*block_count*sizeof(int [64])); 


} 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
intij.f; 

unsigned int a, b, c; 
char *neworg[3]; 
int loop; 

for (j=0;j<3;j++) 
neworgjj] = auxorgframe[j]; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 
chan_out_word(loop,Link 1 Output); 
chan_out_word(loop,Link20utput); 

if(loop=nframes) break; 

chan_in_word(&pict_struct,LinkOInput); 
chan_out_word(pict_struct,Link 1 Output) , 
chan_out_word(pict_stmct,Link20utput); 

chan_in_word(&frame_pred_dct,LinkOInput); 
chan_out_word(frame_pred_dct,Link 1 Output), 
chan_out_word(frame_pred_dct,Link20utput); 

chan_in_message(width*height,neworg[0],LinkOInput); 
chan_out_message(width*height,neworg[0],LinklOutput); 
chan_out_message(width*height,ne\vorg[0],Link2 Output); 

chan_in_message(chrom_width*chrom_height,neworg[ 1 ],LinkOInput); 
chan_out_message(chrom_width*chrom_height,neworg[ 1 ] .Link 1 Output); 
chan_out_message(chrom_width*chrom_height,neworg[ 1 ],Link20utput); 

chan_in_message(chrom_\vidth*chrom_height,neworg[2],LinkOInput); 
chan_out_message(chrom_width*chrom_height,neworg[2],LinklOutput); 
chan_out_message(chrom_width*chrom_height,neworg[2],Link2 Output); 


chanin _message(width*height,predfTame[0],LinkOInput); 

chan_out_message(width*height,predframe[0],LinklOutput), 

chan_out_message(width*height,predframe[0],Link2Output), 

chan_in_message(chrom_width*chrom_height,predframe[l],LinkOInput), 

chan_out_message(chrom_width*chrom_height,predfraine[l],LinklOutput); 

chan_out_message(chrom_width*chrom_height,predframe[l],Link20utput); 

chan_in_message(chrom_width*chrom_height,predframe[2],LinkOInput); 

chan_out_message(chrom_width*chrom_height,predframe[2],LinklOutput); 

chan_out_message(chrom_width*chrom_height,predframe[2],Link20utput); 

/* loop through 36 macroblocks of the picture */ 

transform(predframe,neworg,mbinfo, blocks, 1 ); 

for (a=36; a<73; a++) 

{ 

chan_in_word(&mbinfo[a] dct_type,Link 1 Input); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chan_in_\vord(&blocks[a*block_count+b] [c], Link 1 Input); 

} 

for (a=73; a<l 10; a++) 

{ 

chan_in_word(&mbinfo[a].dct_type,Link2Input); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chan_in_word(&blocks[a*block_count+b][c],Link2Input); 

} 

for (a=0; a<l 10; a++) 

{ 

chan_out_word(mbinfo[a].dct_type,LinkOOutput); 

for (b=0; b<block_count; b++) 
for (c=0; c<64; C++) 

chan_out_word(blocks [a *block_count+b] (c],LinkOOutput); 

} 

}while(loop<nframes); 

chaninwordf&loop.Link 1 Input); 
chan_in_word(&loop,Link21nput); 
chan_out_word(loop,LinkOOutput); 

}/*end of main()*/ 

void dct_type_estimation _ANSl_ARGS_((char *pred, 
char *cur, 

struct mbinfo *mbi)); 


static void add_sub_pred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub)); 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

void transform(char *pred[], 
char *cur[], 
struct mbinfo *mbi, 
int blocks[][64], 
char forward invers ) 

{ 

int i, j, il ,jl, n, cc, offs, lx; 
int k, ii, jj; 

if (forward_invers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

I* loop through 36 macroblocks of the picture *1 

for(k=0; k<36; k++) 

{ 

jj = (k+146)»4; 
ii = (k+146) - (ij«4); 
j=jj«4; 
i = ii«4; 

for (n=0; n<block_count; n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l ; /* color component index *1 
if (cc=0) 

{ 

/* luminance */ 

if ((pict struct=FRAME PICTURE) && mbi[k].dct_type) 

{ 

/* field DCT */ 

offs = i + ((n&l)«3) + width*(j+((n&2)»l)); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + \vidth2*(j+((n&2)«2)); 
lx = width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += width; 

} 


i 


else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? i : i»l, 
jl = (chroma_format ,= CHROMA420) ?j : j»l; 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type 
&& (chroma_format!=CHROMA420)) 

{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(j 1 +((n&2)» 1 )), 
lx = chrom_width«l; 

} 

else 

{ 

/* frame DCT •/ 

offs = il + (n&8) + chrom_width2*(jl+((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += chromwidth; 

} 

if (forward_invers) /* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+o£fs, 

lx, 

blocks[k*block_count+n], 

0 ); 

fdct(blocks[k*block_count+n]); 

} 

else /* 0 for invers DCT */ 

{ 

/* idct(blocks[k*b!ock_count+n]);*/ 
add_sub_pred(pred[cc]+ofifs, 
c ur[cc]+oflfs, 
lx, 

blocks[k*block_count+n], 

i); 

} 

} 

} 

} 



/* add prediction and prediction enor, saturate to 0...255 V 
/* subtract prediction from block data */ 
static void add_sub_pred(char *pred, 


char *cur, 
int lx, int *blk, 
char add sub ) 


{ 

char i, j; 


for 0=O;j<8;j++) 

{ 

for (i=0; i<8; i++) 

{ 

if (addsub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ]; 
else I* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 


void dct_type_estimation(char *pred, 
char *cur, 

struct mbinfo *mbi ) 

{ 

int blk0[128], blkl[128], 
int i, j, iO, jO, offs; 
double sO, si, sqO, sql, sOl; 
double d, r; 
int k, ii,ij; 

for(k=0; k<36; k++) 

{ 

jj = (k+146)»4; 
ii = (k+146) - (jj«4); 
jO =jj«4; 
iO = ii«4; 

if (frame_pred_dct || pict struct ! =FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for 

{ 

offs = vvidth*((j << l)+jO) + iO; 
for (i=0; K16; i++) 

{ 

blkO[ 16*j+ij = cur(offs] - pred[ofifs]; 
blkl[16*j+i] = cur[offs+width] - pred[offs+vvidth]; 
offs++, 

} 

} 


/* correlate fields */ 
sO=s 1 =sqO=sq 1 =s0 1=0; 


for (i=0; K128; i++) 

{ 

s0+= (double) blkO[i]; 
sqO+= (double) (blk0[i]*blk0[i]), 
sl+= (double) blkl[i]; 
sql+= (double) (blkl[i]*blkl[i]); 
s01+= (double) (blk0[i]*blkl[i]); 

} 

sO = 0.125 * sO; 
si =0.125 * si; 

d = (sq0-(s0*s0)/2.0)*(sql-(sl*sl)/2.0) ) 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/2.0)/sqrt(d); 
if (r>0.5) 

mbi[k].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct_type =' 1; /* field DCT *1 

} 

else 

mbi[k].dct_type = 1; (* field DCT */ 

} 

} 

} 


#ifndef PI 

# ifdef M_PI 

# define PI M_PI 

# else 

# define PI 3.14159265358979323846 

# endif 
#endif 


static double c[8][8]; /* transform coefficients */ 

void init_fdct() 

{ 

char i, j; 
double s; 

for (i=0; i<8; i++) 

{ 

s = (i=0) ? sqrt(0. 125) : 0.5; 

for (j=0; j<8; j++) 

c[i][j] = s * cos((PI/8.0)*i*0+0-5)); 

} 


} 


void fdct(int *block) 

{ 

char i, j, k; 
double s; 
double tmp[64]; 

for (i=0; i<8, i++) 
for (j=0;j<8;j-H-) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 


tmp[8*i+j] = s; 

} 

for (j=0; j<8; j++) 
for (i=0; i<8; i++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += cfijfkj * tmp[8*k+j]; 

block[8*i+j] = (int)floor(s+0.499999); 


} 


} 


/********************************/ 

/* */ 

/* t021.c residing at PE5 */ 

/* */ 

/********************************/ 

^include "c:\tc2v2\dos.h" 

#include "c:\tc2v2\float.h" 

#include "c:\tc2v2\math.h" 
include "c:\tc2v2\ctype.h" 

^include "c:\tc2v2\stdio.h" 

#include "c:\tc2v2\stdlib.h" 

#include ”c:\tc2v2\stnng.h" 
tfinclude "c:\tc2v2\chan.h" 

#include "c:\tc2v2\thread.h" 

#include "c \tc2v2\time.h" 

#mclude "c:\tc2v2\ermo.h" 

#deftne GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

^include "c:\nick\h\global.h" 

static void init _ANSI_ARGS_((void)); 

static void readparmfile _ANSI_ARGS_((char *fname)); 

void main(argc,argv,envp,ins,in_ports,outs,out_ports) 
int argc, ins, outs; 
char *argv[],*envp[]; 

CHAN *in_ports[),*out_ports[]; 

{ 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_in_word(&nframes,LinkOInput); 
chan_in_word(&chroma_format,LinkOInput); 

init(); 

putseqO; 


static void inh() 

{ 

int i, size; 
init_fdct(); 

chan_in_word(&mb_vvidth,LinkOInput); 
chanin _word(&mb_height,LinkOInput); 
chan_in_word(&mb_height2,LinkOInput); 
chan_in_word(&width,LinkOInput); 
chan_in_word(&height,LinkOInput); 


chan_in_word(&width2,LinkOInput); 

chan_in_word(&height2,LinkOInput); 

chan_in_word(&chrom_width,LinkOInput); 

chan_in_word(&chrom_height,LinkOInput); 

chan_in_word(&chrom_width2,LinkOInput); 

chan_in_word(&block_count,LinkOInput); 

/* clip table */ 

clp = (char *)malloc(1024); 

clp+= 384; 

for (i=-384; i<640; i++) 
clp[i] = (KO) ? 0 : ((i>255) ? 255 : l); 


for (i=0; i<3; i++) 

{ 

size = (i=0) ? width*height : chrom_width' ,, chrom_height; 

auxorgframe[i] = (char *)malloc(size); 
predffame[i] = (char *)malloc(size); 

} 

mbinfo = (struct mbinfo *)malloc(37*sizeof(struct mbinfo)); 
blocks = (int (*)[64])malloc(37*block_count*sizeof(int [64])); 


void putseq() 

{ 

/* this routine assumes (N % M) == 0 */ 
int i, j, f; 

unsigned int a, b, c; 
char *neworg[3]; 
int loop; 

for (j=0; j<3; j++) 
neworg[j] = auxorgframe[j]; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput); 

if(loop=nframes) break; 

chan_in_word(&pict_struct,LinkOInput); 

chan_in_word(&frame_pred_dct,LinkOInput); 

chan_in_message(width*height,neworg[0],LinkOInput); 

chan_in_message(chrom_width*chrom_height,neworg[ 1 ],LinkOInput); 

chan_inmessage(chrom_width*chrom_height,neworg[2],LinkOInput); 

chan_in_message(width*height,predffame[0],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predframe[l],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predframe[2],LinkOInput); 


/* loop through 37 macroblocks of the picture */ 


transform(predframe,neworg, mbinfo, blocks, 1 ); 

for (a=0; a<37; a++) 

{ 

chan_out_word(mbinfo[a].dct_type,LinkOOutput); 

for (b=0; b<block_count; b++) 
for (c=0; c<64, C++) 

chan_out_word(blocks[a*block_count+b][c],LinkOOutput); 

} 

}while(loop<nffames), 

chan out word(loop,LinkOOutput), 

}/*end of main()*/ 

void dct_type_estimation _ANSI_ARGS_((char *pred, 
char *cur, 

struct mbinfo *mbi)); 

static void add subjred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub)); 

/* subtract prediction and transform prediction error */ 

I* inverse transform prediction error and add prediction */ 

void transform(char "'pred[], 
char *cur[], 
struct mbinfo *mbi, 
int blocks[][64], 
char for\vard_invers ) 

{ 

int i, j, il, jl, n, cc, offs, lx; 
int k, ii,jj; 

if (forwardinvers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add prediction */ 

f* loop through 37 macroblocks of the picture */ 

for(k=0; k<37; k++) 

{ 

jj = (k+182)»4, 
ii = (k+182) - (ij«4), 
j =Ju«4; 
i = ii«4; 


for (n=0; n<block_count, n++) 

{ 

cc = (n<4) ? 0 : (n&l)+l; /* color component index */ 
if (cc==0) 

{ 

/* luminance */ 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type) 

{ 

/* field DCT */ 

offs = i + ((n&l)«3) + width*(j+((n&2)» 1 )); 
lx = width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = \vidth2; 

} 

if (pict_stmct=BOTTOM_FIELD) 
offs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? i : i»I, 
jl = (chroma_format!=CHROMA420) ?j : j»l; 

if ((pictstruct— FRAMEPICTURE) && mbi[k].dct_type 
&& (chroma_format!=CHROMA420)) 

{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

I* frame DCT */ 

offs = il + (n&8) + chrom_width2*(j]+((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct— BOTTOMFIELD) 
offs += chrom_width; 

} 

if (forward_invers) /* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+offs, 

lx, 


blocks[k*block_count+n], 

0 ); 


fdct(blocks[k*block_count+n]), 

} 

else /* 0 for invers DCT */ 

{ 

/* idct(blocks[k*block_count+n]);*/ 
add_sub_pred(pred[cc]+ofifs, 
cur[cc]+offs, 
lx, 

blocks[k*block_count+n], 

i); 

} 

} 

} 

} 


/* add prediction and prediction error, saturate to 0...2SS */ 
/* subtract prediction from block data */ 
static void add_sub_pred(char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub ) 

{ 

char i, j; 

for (j=o;j < 8;j- H -) 

{ 

for (i=0; i<8; i++) 

{ 

if (add_sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ]; 
else /* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8; 
cur+= lx; 
pred+= lx; 

} 

} 


void dct_type_estimation(char *pred, 
char *cur, 

struct mbinfo *mbi ) 

{ 

int blk0[128], blkl [128]; 
int i, j, iO, jO, offs; 
double sO, si, sqO, sql, sOl; 
double d, r; 


mt k, ii.jj, 


for(k=0, k<37; k++) 

{ 

jj = (k+182)»4; 
ii = (k+182) - (jj«4); 
j0=jj«4; 
iO = ii«4; 

if (frame_pred_dct || pict_stnict!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for 0=0;j<8;j++) 

{ 

offs = Width*((j«l)+j0) + iO; 
for (i=0; i<16; i++) 

{ 

blk0[16*j+i] = cur(ofls) - pred[offs]; 
blkl[16*j+i] = cur[oflfs+width] - pred[ofifs+width], 
offs++; 

} 

} 

/* correlate fields */ 
sO=s 1 =sqO=sq 1 =s0 1 =0; 


for (i=0; i< 1 28; i++) 

{ 

s0+= (double) blkOfi]; 
sqO+= (double) (blkO[i]*blkO[i]); 
sl+= (double) blkl [i]; 
sql+= (double) (blkl [i]*blkl[i]); 
s01+= (double) (blkO[i]*blkl[i]); 

} 

sO = 0.125 * sO; 
si =0.125* si; 

d = (sq0-(s0*s0)/2.0)*(sql-(sl*sl)/2.0); 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/2 0)/sqrt(d); 
if (r>0.5) 

mbitk].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct type = 1; /* field DCT */ 

} 

else 

mbi[k].dct_type = I; /* field DCT */ 

} 

} 


#ifndef PI 

# ifdefMPI 

# define PI M_PI 

# else 

# define PI 3.14159265358979323846 

# endif 
#endif 


static double c[8][8], /* transform coefficients */ 

void init_fdct() 

{ 

char i, j; 
double s; 

for (i=0; i<8; i++) 

{ 

s = (i=0) ? sqrt(0.125) : 0.5; 

for (j=0;j<8;j++) 

c[i]Jj] = s * cos((PI/8.0)*i*(j+0.5)); 

} 

} 

void fdct(int *block) 

{ 

char i, j, k; 
double s; 
double tmp[64]; 

for (i=0, i<8; i++) 
for (j = 0; j<8; j-* - * - ) 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 

tmp[8*i+j] = s; 

} 

for (j=0;j < 8;j ++ ) 
for (i=0; i<8; i++) 

{ 

s = 0.0; 

for (k=0; k<8; k-H-) 
s += c[i][k] * tmp[8*k+j]; 

block[8*i+j] = (int)floor(s-K).499999); 


} 


} 


/********************************/ 

/* */ 

/* t022 c residing at PE6 */ 

/* */ 

/********************************/ 

#include "c:\tc2v2\dos.h" 

#include "c:\tc2v2\float.h" 

#include "c:\tc2v2\math.h" 

#include "c:\tc2v2\ctype.h" 

#include "c:\tc2v2\stdio.h" 

#include "c:\tc2v2\stdlib.h" 

#include "c:\tc2v2\string h" 

#include "c:\tc2v2\chan.h" 

^include "c:\tc2v2\thread.h" 

#include "c:\tc2v2\time.h" 

#include "c:\tc2v2\ermo.h" 

#define GLOBAL /* used by global.h */ 

#include "c:\nick\h\constl.h" 

#include "c:\nick\h\config.h" 

#include "c:\nick\h\global h" 

static void init _ANSI_ARGS_((void)); 

static void readparmfile AN SI_ ARGS_((char *fname)); 

void main(argc,argv,envp,ins,in_ports,outs,out_ports) 
int argc, ins, outs; 
char *argv[],*envp[]; 

CHAN *in_ports[],*outjx)rts[]; 

{ 

/* read parameter file */ 
chan_in_word(&M,LinkOInput); 
chan_in_word(&nframes,LinkOInput); 
chanin _word(&chroma_format,LinkOInput); 

imt(); 

putseq(); 


} 


static void init() 

{ 

int i, size; 
imt_fdct(), 

chan_in_word(&mb_width,Link01nput); 

chan_in_word(&mb_height,LinkOInput); 

chan_in_word(&mb_height2,Link01nput); 

chan_in_word(&width,LinkOInput); 

chan_in_word(&height,LinkOInput); 



chan_in_word(&width2,LinkOInput); 

chan_in_word(&height2,LinkOInput), 

chan_in_word(&chrom_width,LinkOInput); 

chan_in_word(&chrom_height,LinkOInput); 

chan_in_word(&chrom_width2,LinkOInput), 

chan in word(&block_count,LinkOInput); 

/* clip table */ 

clp = (char *)malloc(1024); 

clp+= 384; 

for (i=-384; i<640; i++) 
clp[i] = (i<0) ? 0 : ((i>255) ? 255 : i); 


for (i=0; i<3; i++) 

{ 

size = (i=0) ? width*height : chrom_width*chrom_height; 

auxorgframe[i] = (char *)malloc(size); 
predframe[i] = (char *)malloc(size); 

} 

mbinfo = (struct mbinfo *)malloc(37*sizeof(struct mbinfo)); 
blocks = (int (*)[64])malloc(37*block_count*sizeof(int [64])); 


void putseqO 

{ 

/* this routine assumes (N % M) = 0 */ 
int i, j, f; 

unsigned int a, b, c; 
char *neworg[3]; 
int loop; 

for 0=O;j < 3;j ++ ) 
neworglj] = auxorgframe[j]; 


/* loop through all frames in encoding/decoding order */ 
do{ 

chan_in_word(&loop,LinkOInput), 

if(loop=nframes) break; 

chan_in_word(&pict_struct,LinkOInput); 

chan_in_word(&frame_pred_dct,LinkOInput); 

chan_in_message(width*height,neworg[0],Link0Input); 

chan_in_message(chrom_\vidth*chrom_height,neworg[l],LinkOInput); 

chan_in_message(chrom_\vidth*chrom_height,neworg[2],LinkOInput); 

chan_in_message(width*height,predframe[0],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predframe[l],LinkOInput); 

chan_in_message(chrom_width*chrom_height,predframe[2],LinkOInput); 


/* loop through 37 macroblocks of the picture V 

transform(predffame, neworg, mbinfo, blocks, 1); 

for (a=0, a<37; a++) 

{ 

chan_out_word(mbinfo[a].dct_type,LinkOOutput); 

for (b=0, b<block_count; b++) 
for (c=0; c<64; C++) 

chan out word(blocks[a*block_count+b] [c],LinkOOutput); 

} 

}while(loop<nframes); 

chan_out_\vord(loop,LinkOOutput); 

}/*end of main()*/ 

void dct_type_estimation _ANSI_ARGS_((char *pred, 
char *cur, 

struct mbinfo *mbi)); 

static void add_sub_pred _ANSI_ARGS_((char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub)); 

void transform(char *pred[], 
char *cur[], 
struct mbinfo *mbi, 
int blocks [][64], 
char forwardinvers ) 

{ 

int i, j, il , jl, n, cc, offs, lx; 
int k, ii, jj; 

if (forwardjnvers) 

dct_type_estimation(pred[0], cur[0], mbi ) ; 

/* subtract prediction and transform prediction error */ 

/* inverse transform prediction error and add predicUon */ 

/* loop through 37 macroblocks of the picture */ 

for(k=0; k<37; k++) 

{ 

jj = (k+219)»4; 
ii = (k+219) - (jj«4); 
j =jj«4; 
i = ii«4. 


for (n=0; n<block_count; n++) 

{ 


cc = (n<4) ? 0 : (n&l)+l; /* color component index */ 
if (cc=0) 

{ 

/* luminance *1 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type) 

{ 

/* field DCT *1 

offs = i + ((n&l)«3) + width*(j+((n&2)» 1 )); 
lx = width«l, 

} 

else 

{ 

/* frame DCT */ 

offs = i + ((n&l)«3) + width2*(j+((n&2)«2)); 
lx = width2; 

} 

if (pict_struct==BOTTOM_FIELD) 
offs += width; 

} 

else 

{ 

/* chrominance */ 

/* scale coordinates */ 

il = (chroma_format=CHROMA444) ? i : i»l; 
jl = (chroma format ! =CHROMA420) ? j : j»l; 

if ((pict_struct=FRAME_PICTURE) && mbi[k].dct_type 
&& (chroma_format!=CHROMA420)) 

{ 

/* field DCT */ 

offs = il + (n&8) + chrom_width*(jl+((n&2)»l)); 
lx = chrom_width«l; 

} 

else 

{ 

/* frame DCT */ 

offs = il + (n&8) + chrom_width2*(jl + ((n&2)«2)); 
lx = chrom_width2; 

} 

if (pict_struct=BOTTOM_FIELD) 
offs += chromwidth; 

} 

if (forward invers) t* 1 for forward DCT */ 

{ 

add_sub_pred(pred[cc]+offs, 

cur[cc]+offs, 

lx, 

blocks[k*block_count+n], 

0); 


fdct(blocks{k*block_count+n]); 

} 

else /* 0 for invers DCT */ 


{ 

/* idct(blocks[k*block_count+n]);*/ 
add_sub_pred(pred[cc]+offs, 
cur[cc]+offs, 
lx, 

blocks[k*block_count+n], 

1 ); 

} 


} 


} 

} 


/* add prediction and prediction error, saturate to 0...255 */ 
/* subtract prediction from block data */ 
static void add_sub_pred(char *pred, 
char *cur, 
int lx, int *blk, 
char add_sub ) 

{ 

char i,j; 

for (j=0; j<8;j++) 

{ 

for (i=0; i<8; i++) 

{ 

if (add_sub) /* 1 for addition */ 
cur[i] = clp[ blk[i] + pred[i] ]; 
else /* 0 for subtractction */ 
blk[i] = cur[i] - pred[i]; 

} 

blk+= 8, 
cur+= lx; 
pred+= lx; 

} 

} 


void dct_type_estimation(char *pred, 
char *cur, 

struct mbinfo *mbi ) 

{ 

int blkO[ 128], blkl [ 128]; 
int i, j, iO, jO, offs; 
double sO, si, sqO, sql, sOl; 
double d, r; 
int k, ii,jj; 


for(k=0, k<37; k++) 


{ 

jj = (k+219)»4; 
ii = (k+219) - (u«4); 
jO = jj«4; 
iO = ii«4; 

if (frame_pred_dct [| pict_struct!=FRAME_PICTURE) 
mbi[k].dct_type = 0; 
else 
{ 

for G=0;j<8;j++) 

{ 

offs = width*((j«l)+jO) + iO; 
for (i=0; i<16; i++) 

{ 

blk0[16*j+i] = cur[offs] - predfoffs]; 
blkl[16*j+i] = cur[oflfs+width] - pred[oflfs+width]; 
offs++; 

} 

} 

/* correlate fields */ 
s0=s 1 =sq0=sq 1 =s0 1=0; 


for (i=0; i<128; i++) 

{ 

s0+= (double) blk0[ij; 
sqO+= (double) (blkO[i]*blkO[i]); 
sl+= (double) blkl [i]; 
sql+= (double) (blkl[i]*blkl[i)); 
s01+= (double) (blkO[i]*blkl[i]); 

} 

s0 = 0.125 * sO; 
si =0.125 * si; 

d = (sq0-(s0*s0)/2.0)*(sq 1 -(s 1 *s 1 )/2.0), 

if (d>0.0) 

{ 

r = (s01-(s0*sl)/2.0)/sqrt(d); 
if (r>0.5) 

mbi[k].dct_type = 0; /* frame DCT */ 
else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

else 

mbi[k].dct_type = 1; /* field DCT */ 

} 

} 


#ifndefPI 

# ifdef M_PI 

# define PI M PI 


# else 

# define PI 3.14159265358979323846 

# endif 
#endif 


static double c[8][8]; /* transform coefficients */ 

void init_fdct() 

{ 

char i.j; 
double s; 


for (i=0; i<8; i++) 

{ 

s = (i=0)?sqrt(0.125) : 0.5; 

for (j=0;j<8;j-H-) 

c[i][j] = s * cos((PI/8.0)*i*(j+0.5)); 

} 

} 

void fdct(int *block) 

{ 

char i, j, k; 
double s; 
double tmp[64]; 

for (i=0; i<8; i++) 
for (j=0; j<8; j++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[j][k] * block[8*i+k]; 


tmp[8*i+j] = s; 

} 

for (j=0;j < 8;j++) 
for (i=0; i<8; i++) 

{ 

s = 0.0; 

for (k=0; k<8; k++) 
s += c[i][k] * tmp[8*k+j]; 


} 


} 


block[8*i+j] = (int)floor(s+0.499999); 


